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Abstract. We develop a notion of containment for independent sets in hypergraphs. For 
every r-uniform hypergraph G, we find a relatively small collection C of vertex subsets, 
such that every independent set of G is contained within a member of C, and no member of 
C is large; the collection, which is in various respects optimal, reveals an underlying struc- 
ture to the independent sets. The containers offer a straightforward and unified approach 
to many combinatorial questions concerned (usually implicitly) with independence. 

With regard to colouring, it follows that many (including all simple) r-uniform hyper- 
graphs of average degree d have list chromatic number at least (l/(r — l) 2 + o(l)) log r d. 
For r = 2 this improves Alon's bound and is tight. For r > 3, previous bounds were weak 
but the present inequality is close to optimal. 

In the context of extremal graph theory, it follows that, for each ^-uniform hyper- 
graph H of order k, there is a collection C of ^-uniform hypergraphs of order n each with 
o(n k ) copies of H, such that every H-fiee ^-uniform hypergraph of order n is a subgraph 
of a hypergraph in C, and log \C\ < cr/" 1 /" 1 ^' logn where m(H) is a standard param- 
eter (there is a similar statement for induced subgraphs). This yields simple proofs of 
many hitherto difficult results: these include the number of //-free hypergraphs, sparsity 
theorems of Conlon-Gowers and Schacht, and the full KLR conjecture. 

Likewise, for systems of linear equations the containers supply, for example, bounds on 
the number of solution-free sets (including Sidon sets, for which we give both lower and 
upper bounds) and the existence of solutions in sparse random subsets. 

Balogh, Morris and Samotij have independently obtained related results. 



1. Introduction 

The notion of an independent set plays a fundamental role in the study of hypergraphs. 
An r-uniform hypergraph, or r-graph, G is a pair (V(G),E(G)) comprising two sets, the 
vertices V(G) and edges E(G) of G, where each edge e G E(G) is a set of r elements 
of V(G). Hence a 2-graph is an ordinary graph. A set / C V(G) is independent if there is 
no edge e 6 E(G) with e C I. 

Whilst there are many theorems in the literature that can be phrased in terms of es- 
timating the number of independent sets in certain hypergraphs (we shall mention some 
of these shortly), the question per se of how many independent sets there can be in a 
graph has attracted attention only relatively lately. The maximum number of independent 
sets in a graph of given average degree can be determined easily via the Kruskal-Katona 
theorem [33\ 127] . but for regular graphs the maximum is harder to find: following a good 
estimate by Alon [I], the exact value for bipartite graphs was determined by Kahn [26j via 
an elegant entropy argument, and his result was extended to all graphs by Zhao |53j . There 
are at most (2 d+1 — l) n / M = 2 n / 2 +°( n / d ) independent sets in a eZ-regular graph of order n 
(that is, having n vertices), and this number is attained by n/2d disjoint copies of -fQ,d- 
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It would be convenient for many purposes if there were at most 2°( n ) independent sets 
in an r-graph G of order n and average degree d, but examples like those just cited show 
this hope to be a forlorn one. Nevertheless, for the applications we have in mind, it is 
enough to find a good collection C of containers for independent sets: this is a family of 
subsets of V(G) such that, for each independent set /, there is a set C G C with I C C, 
and \C\ = 2°( n ). Of course, we could just take C = {V(G)}, but this collection would not 
be helpful: for C to be of use, a further condition is needed that each container C G C is 
not large, in a sense made precise later. 

Another immediate candidate for C is the collection of maximal independent sets, but 
this too can be large; for example, if d is even, adding a 1-factor into the vertex classes 
of each K^d of the graph {n/2d)K^d produces a (d + l)-regular graph with at least 2 n / 4 
maximal independent sets. (The maximum number of maximal independent sets in any 
graph of order n was determined by Moon and Moser |36j.) 

The main purpose of this paper is to show that an r-graph G of average degree d and 
order n does have a small collection C of containers. Typically, but not always, \C\ = 
2 n / dl/ r 1+o(1)) _ Results of this kind were known previously in special cases. Sapozhenko |43t 
|4"4"1 |4"6| 05] treated regular 2-graphs. Containers for r-graphs were introduced and used 
in |47] in the case of simple regular r-graphs (a hypergraph is simple or linear if every pair 
of vertices lies in at most one edge). However, the most interesting applications require 
containers for non-regular r-graphs. Finding such containers presents significant difficulties 
and the method here is unrelated to that of |47| . 

We describe our main results about containers in $21 The fundamental result is Theo- 
rem [2J2 stated in §2.31 It is worth mentioning that the statement applies to all r-graphs G 
but it gives useful information only if d is large (though independently of n). Following ^21 
the containers are constructed in $3]and their properties analysed in 21 where Theorem 12.51 
is proved. Some corollaries, more amenable to application, are developed in $5]and and 
in 33 we give an example to show that the main result is, in a sense, best possible. Finally, 
in ^8^1121 we give details of some applications. 

Before getting down to details, though, we outline those applications to be discussed. 
Each of them involves, implicitly, dealing with the independent sets in some hypergraph G, 
and in each case it suffices to find an appropriate set of containers. Once some simple 
parameter of G has been calculated, the existence of the desired containers follows straight 
away from the main theorem or one of its consequences in and the applications are 

finished off by routine arguments (plus, in §101 a non-trivial removal lemma). The results 
obtained are generally best possible. 

Balogh, Morris and Samotij [6j have independently developed a container theorem akin 
to Corollary 12.71 together with applications, including a proof of Theorem 112.21 for balanced 
graphs. 

1.1. A little notation. We use standard notation. In particular, for m,n G N we let 

[n] = {1, . . . , n} and [m, n] = {m, . . . , n}. For collections of subsets we write, for example, 
[m,n]( s ) = {a C [m, n] : \o~\ = s}, [m, n]( >s ^ = {a C [m, n] : |<r| > s}, and so on. As 
usual, V[n] denotes the collection of all subsets of [n]. If G is a hypergraph we write 
e{G) = \E{G)\ for the number of edges of G and v{G) = \V(G)\ for the number of vertices 
of G. If S C V{G) then G[S] denotes the subhypergraph of G induced by S, that is, 
G[S] = (S,E{G)r\VS). 
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1.2. List colourings. A 2-graph G is said to be k-choosable if, whenever for each vertex 
v £ V(G) we assign a list L v of k colours to v, then it is possible to choose a colour for v 
from the list L v , so that no two adjacent vertices receive the same colour. The list chromatic 
number xi{G) (also called the choice number) is the smallest k such that G is /c-choosable. If 
all the lists are the same then a list colouring is just an ordinary ^-colouring and so Xi(G) 
is at least x(G), the ordinary chromatic number of G. This natural definition was first 
studied by Vizing [52] and by Erdos, Rubin and Taylor [19]. One of the main discoveries 
of [19] is that Xi{G) can be much larger than x{G), because Xl(^d,d) = (1 + o(l)) log 2 d, 
whereas x{ K d,d) = 2. 

In fact, unlike x(G), Xi(G) must grow with the minimum degree of the graph G. Alon [2, 
j3] showed that Xi{G) > (1/2 + o(l)) log 2 d holds for any graph G of minimum degree d. 

There is a straightforward reason, as pointed out by Alon and Kostochka [4J, why the 
same is not true for r-graphs if r > 3. Let F be some graph on n vertices, say F = (n/2)K2, 
and let G be some r-graph each of whose edges contains an edge of F. Then xi(G) = Xl(F), 
so in this example Xi(G) = 2, whereas the average degree of G can be large. However, if we 
restrict to simple r-graphs the situation is different. Following work of Haxell and Pei [22J 
on Steiner systems, Haxell and Verstraete [23] proved that xi(G) > (log dj 5 log log d) ' 
for simple d-regular 3-graphs G. Alon and Kostochka [4J showed Xi(G) > (log c?) 1 /^ -1 ) for 
simple d-regular r-graphs G, and in [17] this was improved to Xi(G) = O(logd). 

We extend this to all simple r-graphs, at the same time giving a better constant. 

Theorem 1.1. Let r E N be fixed. Let G be a simple r-graph with average degree d. Then, 
as d — )■ oo, 

XliG) > (l + o(l)) J 1 log r d 
holds. Moreover, if G is regular then 

Xi(G) > (l + o(l))^-log r d. 

r — 1 

Note that, for r = 2, this improves Alon's bound [3] by a factor of 2 and is best possible. 
We think that the bound given for regular r-graphs might hold for general r-graphs and 
moreover that it too might be best possible (see 

Theorem 11.11 is a weaker version of Corollary 18.21 which gives a bound for some non- 
simple r-graphs also. We do not give a general bound for all r-graphs because it would 
be rather complicated to state: however, in any particular instance, a bound can readily 
be derived from the results in £JHJ This would cover, for example the theorem of Alon and 
Kostochka [5] that if at least half the (r — l)-tuples of vertices of G lie in at least m edges 
then Xi{G) > c r logm. 

1.3. H-iree graphs. An ^-graph on vertex set [N] is said to be Li-free if it contains no 
subgraph isomorphic to the £-graph H. 

As far as H-iree graphs are concerned, our main result is this: for any given £-graph H, 
though there are many H-free ^-graphs, each of these is contained in one of a very few l- 
graphs that are almost H-iree. More exactly, there is a small collection C of ^-graphs, each 
H-iree ^-graph being a subgraph of an £-graph in C, and no £-graph in C having more than 
o(N v ( H ^) copies of H. The main content of the theorem is that the number of containers 
is very small. For graphs at least, Szemeredi's regularity lemma gives a set of containers 
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with log I C | = o(N 2 ), but the size of C in the theorem is much smaller. It is expressed in 
terms of a parameter m{H) that appears often in the literature. 

Definition 1.2. For an £-graph H with e{H) > 2, let 

e(H') - 1 



m(H) = max 



H'<ZH,e(H')>\ v(H') -I' 

Sometimes, H is called (strictly) balanced if the maximum is attained (uniquely) when 
H' = H. However, this restriction is not needed in any of our arguments and it is ignored. 
We then obtain the following, where n(H) = lmr/v^oo ex(N, -fO(^) 

Theorem 1.3. Let H be an l-graph with e(H) > 2 and let e > 0. For some c > and for 
N sufficiently large, there exists a collection C of l-graphs on vertex set [N] such that 

(a) for every H-free l-graph I on vertex set [N] there exists C E C with I C C, 

(b) moreover, for every pair I,C in (a), there exists T = (T\, . . . ,T S ) where T\ C I, 
s<c and J2i \T t \ < ciV^ 1 /"^ such that C = C{T), 

(c) for every l-graph C £ C, the number of copies of H in C is at most eN v ( H \ and 
e(C)<(n(H) + e)( N e ), ' 

(d) log |C| < eiV^~ 1 / m ( ff ) log N . 

The condition (b) just says that C is determined by T, which is comprised of small 
subsets of V(G) (actually subsets of I — this sometimes matters, because it means (b) is 
stronger than (d)). 

The existence of C follows straightforwardly from the results in £J21 a s shown in §21 by 
considering the e(.£f)-graph G = G(N,H), whose n = (T) vertices are the £-sets in [N], 
and whose edges are subsets of V(G) spanning a copy of H in [N]. The subsets of V(G) are 
then ^-graphs with vertex set [N], independent sets in G corresponding to H-free ^-graphs. 

One corollary of Theorem 11.31 is the next one. In the case £ = 2, it was proved for 
complete H by Erdos, Kleitman and Rothschild [T7] and for general H by Erdos, Frankl and 
Rodl [16J. Nagle, Rodl and Schacht [37] proved it for general I using hypergraph regularity 
methods. The proof here follows quite easily from Theorem 11.31 using the supersaturation 
theorem of Erdos and Simonovits j47j : details are in §9) 

Corollary 1.4. Let H be an l-graph. The number of H-free l-graphs on vertex set [N] is 

2 (n(H)+o(l))( N e ) ^ 

For ^-graphs H which satisfy ex(iV, H) = o(N £ ) (when 1 = 2 this means H is bipartite), 
we have tt(H) = 0, and Corollary 11.41 is unhelpful. Nevertheless our results can still be 
useful, provided appropriate information about G(N, H) is available. The simplest case 
is I = 2 and H = K 2)2 = C 4 , where it is well known that ex(N,C 4 ) = (1/2 + o(l))iV 3 / 2 
(Erdos, Renyi and Sos |18j). implying the trivial upper bound 2°( Ar3/ ' 2 lo s N ) f or the number 
of C4-free graphs. A direct application of our results readily gives the bound 2°( Ar3/2 ), but 
we don't give details because Kleitman and Winston |28] obtained a finer bound, namely 
2(i-082+o(i))A r3 2 ^ More generally, the number of i^ Si t-free graphs has been well estimated 
by Balogh and Samotij [7j. 

Alongside the many results about H-free graphs, there is a corresponding corpus about 
induced H-free graphs, that is, graphs with no induced subgraph isomorphic to H. The 
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number of induced if- free graphs was closely estimated by Promel and Steger [38J, and 
there have been many subsequent refinements. 

If I is an induced H-free £-graph, we need to ask what kind of object C must be in order 
that the inclusion / C C is helpful; if C itself is just an £-graph and I C C means I is a 
subgraph of C, as in Theorem 1 1.3\ then the induced subgraphs of I differ from those of C, 
which is no use. We borrow the notion of 2-coloured multigraph from |34}I51]. A 2-coloured 
^-multigraph C on vertex set [N] is a pair of edge sets Cr,Cb C \Ny® , which we call the 
red and the blue edge sets. Let / be an i'-graph on [N]. Then we write I C C if E(I) C Gr 
and [iV]W \ E{I) C C B - Thus edges in C R n C B do not affect whether I C C. 

Theorem 1.5. Let H be an l-graph and let e > 0. For some c > and for N sufficiently 
large, there exists a collection C of 2-coloured £-multigraphs on vertex set [N] such that 

(a) for every l-graph I on vertex set [N] with no induced copy of H there exists C £C 
with I C C , 

(b) for every C G C, the number of copies of H in C is at most eN v ( H \ 

(c) lo S \C\<cN £ -^M( V( f ) nio g N. 

This theorem can be used to recover basic results, akin to CoroHarv ll.41 about the number 
of induced fZ-free ^-graphs. In fact we can state a probabilistic version just as readily. 
Let G^'(N,p) be a random £-graph obtained by choosing edges independently from the 
complete ^-graph with probability p. The next statement involves a function h p (H), 
whose definition is natural enough but which is deferred until <j9l 

Theorem 1.6. Let < p < 1 and let H be an (.-graph. Then 

P(G (e) (N,p) is induced-H-free) = 2 -(M^)+°(i))( i D . 

For graphs, that is, £ = 2, this theorem was proved for p = 1/2 by Promel and Steger [381 
Theorem 1.3] and for general p by Bollobas and Thomason [HJ Theorem 1.1] (as illuminated 
by Marchant and Thomason [35] ) . For general £ it was proved for p = 1/2 by Dotson and 
Nagle [13], using hypergraph regularity techniques. 

It will be clear that similar arguments to those described in this section can be used to 
obtain container results about other structures, such as tournaments. 

1.4. Linear equations. Let F be either a finite field or the set of integers [N] . We consider 
linear systems of equations Ax = b, where A is a k x r matrix with entries in F, x £ F r 
and b G F k . We call such a triple (F, A,b) a k x r linear system. 

Definition 1.7. For a/cxr linear system (F, A, b), a subset / C F is solution-free if there 
is no x G I r with Ax = b, and ex(i ? , A, b) is the maximum size of a solution-free subset. 

The notion of a solution-free subset is analogous to that of an H-free hypergraph in 
the previous section. Once again, our contribution to this topic is a container theorem for 
solution- free sets. The statement (which extends to equations over abelian groups) is given 
in Theorem HU.2t it requires a few technical definitions so we omit it from the introduction. 

Nevertheless we mention a consequence for counting solution-free subsets. For an equa- 
tion Ax = b, how many solution-free subsets of F are there? A well-known instance of this 
question is to find the number of subsets S C [N] containing no solution to x + y = z; the 
asymptotic answer, conjectured by Cameron and Erdos [UJ, was given by Green [24j and 
by Sapozhenko [46j . 
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For a general system, every subset of a solution-free set is itself solution-free, so there are 
at least 2 ex ( F,A ' b ) solution-free sets. For a single equation (the case k = 1), it was shown 
by Green [25] that there are at most 2 ex ( F,A ' b ) + °(' F ') solution-free subsets. The same bound 
does not always hold for k > 2. If some variables are closely tied to other variables - 
say the equations imply that x = y — then there can be significantly more than 2 ex ( F,A ' b ) 
solution-free sets. However, a natural condition on A rules out closely tied variables, and 
in this case Green's bound holds good. 

Definition 1.8. We say that A has full rank if given any b 6 F k there exists x £ F r with 
Ax = b. We then say that A is abundant if it has full rank and every k x (r — 2) submatrix 
obtained by removing a pair of columns from A still has full rank. 

Theorem 1.9. Let (F,A,b) be a k x r linear system with A abundant. Then the number 
of solution- free subsets of F is 2 ex ( FiA ' b ) + °(l F D. Here o(l) — > as \F\ — > oo, with A fixed in 
the case F = [N]. 

For example, take A = (1,1,-1) and b = (0). Theorem 11.91 savs that the number of 
sum- free subsets of [N] is 2 N / 2+o(N \ giving a new proof of the weak form of the Cameron 
and Erdos conjecture, proved independently by Alon pQ, by Calkin |ll)| and by Erdos and 
Granville (unpublished) . 

Similar results hold when F is an abelian group. For the proof of Theorem 11.91 we need 
a result for equations analogous to the supersaturation theorem for graphs: this is the 
removal lemma of Krai', Serra and Vena |31| 132] . More information is given in §101 

1.5. Sidon sets. For linear systems where ex(F, A, b) = o(\F\), Theorem 11.91 is uninfor- 
mative. One of the most prominent examples is that of Sidon sets. A set A C [n] is Sidon 
if every sum of two elements is distinct, i.e., there are no solutions to w + x = y + z with 
{w, x} ^ {y, z}. It is easy to see that a Sidon set has size at most [\/2rel , since each of the 
|5| (|5| — l)/2 values x — y, where x,y G S and y < x, are distinct and lie in {1, . . . , n — 1}. 
Erdos and Turan [21] improved this upper bound to IS"! < (1 + o(l))y / n, and there are 
examples achieving this bound. 

It is natural to ask, as Cameron and Erdos did [11], how many Sidon sets there are, 
and the answer clearly lies between 2( 1+ °( 1 ^ v/ ™ and 2°(^/™ logn ). Neither of these bounds, it 
turns out, is tight. 

Theorem 1.10. There are between 2( L16 +°( 1 ))v / « and 2( 55 +°( 1 ))v / ™ Sidon subsets of[n). 

The lower bound gives a negative answer to the open question of whether there are 
only 2( 1+ °( 1 »v / " Sidon sets. The upper bound, also proved by Kohayakawa, Lee, Rodl and 
Samotij [29] (in fact with a better constant), follows directly by plugging in the appropriate 
numbers into a container-counting theorem. For details see §111 

1.6. Sparsity. In recent times, there has been interest in the extent to which theorems 
holding for dense structures hold also for sparse random substructures. Our results can 
be applied in this context, and we give some illustrative examples involving the notions of 
H-free graphs and solution-free subsets already discussed. 

The application of our results always fits a simple paradigm. Typically we want some 
statement to hold for a random substructure, with high probability; by considering an 
appropriate collection of containers, the fact that there is a small number of containers 
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means that the work is reduced, via the union bound, to establishing a (generally much 
simpler) statement for a single container. 

For example, consider a random ^-graph G^(N,p), as defined in §1.31 Evidently there 
are H-fiee subgraphs of G^'(N,p) with pex(N, H) edges, but are there significantly larger 
H-iiee subgraphs? Kohayakawa, Luczak and Rodl [30] conjectured that if p > cN^ l / m<yH ^ 
then H-iree subgraphs of G^(N,p) almost surely have at most (1 + o(l))pex(iV, H) edges. 
This conjecture was recently proved by Conlon and Gowers [12] (for strictly balanced H) 
and by Schacht [38], using different methods. Our methods give an alternative proof. For 
each container C 6 C given by Theorem ll.3[ it is easily seen that with high probability 
G^(N,p) contains not much more than pe(C) < (ir(H) + o(l))p( N e ) edges of C, and by 
the union bound this holds for all C £ C, and hence also for all H-iree ^-graphs. 

Theorem 1.11. Let H be an t-graph and let < 7 < 1. For some c > 0, for N sufficiently 
large and for p > ciV -1 /" 1 ^, the following event holds with probability greater than 1 — 
exp{-7 3 p(^)/512}: 

every H-free subgraph ofG^(N,p) has at most (tt(H) +^)p(^) edges. 

Other related conjectures, including what has become known as the KLR conjecture, 
were made in [30], not all of which have previously been proved in full, but they all follow 
from Theorem 11.31 in a similar way. Details are in §121 

The same arguments can be applied to solution sets of linear equations. Here is a typical 
consequence. 

Theorem 1.12 (Conlon and Gowers [12], Schacht [IE]). Let I > 3 and e > 0. There exists 
a constant c > such that for p > cN~ l l^~ l \ if X C [N] is a random subset chosen with 
probability p, then with probability tending to 1 as N — >• 00, any subset of X of size e\X\ 
contains an arithmetic progression of length i. 

Further examples and details can be found in §121 

2. Containers 

A couple of simple notions are needed for the statement of the main theorem, and we 
define these now. They are the co-degree function and degree measure. 

2.1. The co-degree function 5(G,t). The present results about containers were origi- 
nally motivated by the study of the list chromatic number of simple hypergraphs, described 
in §21 The main difficulties in the construction of containers are already present in the sim- 
ple case. However the method can be adapted efficiently to any hypergraph. The size and 
number of the containers depends on the way the edges overlap, but the dependence can 
be encapsulated by a single parameter, which is usually quite straightforward to compute. 
This parameter appears in most of the theorems. 

First, we define the degree of a subset of vertices, in the natural way. 

Definition 2.1. The degree of set of vertices a C V(G) is the number of edges containing 
<t; that is, 

d(a) = \{eeE(G) : a C e}\ . 
If |<t| = 1, that is a = {v} where v £ V(G), we generally write d(v) instead of d({v}). 



8 



DAVID SAXTON AND ANDREW THOMASON 



We can now define the co-degree function 5(G,t). This is a function of the parameter 
r, a parameter used in the construction of containers. 

Definition 2.2. Let G be an r-graph of order n and average degree d. Let r > 0. Given 
v e V(G) and 2<j<r, let 

d ij) (v) = max{ d{a) : v e a C |<r| = j } ■ 

If d > we define <5j by the equation 

bj T i- x nd = d {j \v). 

V 

Then the co-degree function 5(G, r) is defined by 

r 

6(G,t) = 2®~ 1 J2 2-( j 2 1 )s j . 
i=2 

If d = we define <5(G, r) = 0. 

There is nothing significant about the powers of 2 in the definition; they are just constants 
needed for Lemma 14.21 

Remark 2.3. The parameter r appears in the main theorem, Theorem 1 2. 5 \ and the smaller 
that r can be made the stronger the result becomes. The constraint on r comes from a 
lower bound on 5(G, r). It can be seen that if r decreases then the values of the Sj increase, 
and hence so does S(G,r); indeed S(G,t) — > as r — > oo. A typical application will have 
r as small as possible subject to 8(G,t) being less than some constant, say 1/r!. 

Here are some observations intended to indicate the optimal size of r. Observe first that, 
unless G has isolated vertices, d^\v) > 1 holds for all v, and so 5j > r 1_J jd. The largest 
of these bounds is 5 r > t 1 ~ t jd (r is invariably less than one) and so, for fixed r and large d, 
it will always be that for 5(G, r) to be small we must choose r at least as large as d~ 1 ^ r ~ 1 \ 

In a simple hypergraph, d(o~) < 1 holds whenever |<r| > 2, and so Sj < t 1- - 7 jd. In this 
case the largest of the <5j's is 5 r , and we can make 6(G,t) small by choosing r just a little 
larger than d~ l ^ r ~ l \ In fact, for any hypergraph whose edges are uniformly distributed, 
6 r is once again the Sj which dominates, as a simple calculation (which we omit) shows, so 
here again S(G, r) is small if r is larger than d —1 /( r ~l). 

Sometimes, though, the dominant Sj is not S r . One example of this is in the case of Sidon 
sets: when \S\ < n 2 / 3 it is the value of 82 which is the most important. Another example 
is the hypergraph describing H-iree ^-graphs: here the most important Sj is determined by 
which subgraph H' C H achieves the maximum of (e(H') — l)/(v(H') —£), and this is how 
m(H) enters in. But in each of our examples the values are easily checked. 

In summary, we must always choose r > d^ l ^ r ~ l \ and for simple or uniformly dis- 
tributed hypergraphs the value need not be much larger. But there are applications which 
are far from uniformly distributed, where r needs to be larger and where the behaviour of 
S(G,t) will prove crucial. 

2.2. Degree measure. We mentioned in the introduction that the containers must not 
be too large. For a substantial number of applications it suffices that \C\ < (1 — c)\G\ for 
some constant c. This is achievable for regular hypergraphs but it clearly is unattainable 
in general; for example, if G = K^ n -d (which, for large n, has average degree close to 2d) 
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then some container must have size at least n — d. Other applications require that the 
number of edges inside a container, that is, e(G[C]), is small. This is attainable in general 
but there are applications where what matters is \C\ rather than e(G[C]). 

In fact we measure the size of containers by what we call degree measure. It turns out 
that if the degree measure is bounded then it is possible to recover all the properties of 
containers that are needed. 

Definition 2.4. Let G be an r-graph of order n and average degree d. Let S C V(G). 
The degree measure n(S) of S is defined by 

Thus fi is a probability measure on V{G). Note that if G is regular then ^(S) = \S\/n, 
which is the uniform measure of S. 

2.3. The main theorem. The essential idea for demonstrating that a collection of con- 
tainers is small is this: each container is specified by just a small set of vertices, meaning 
that, given some small set T C V(G), there is a construction which produces another larger 
set C = C(T), and the construction is such that for any independent set / there is some 
T C I which produces a C with I C C. If T is small, the number of choices for T is not 
large, and so the number of containers is not large. Actually, we shall generate C from not 
just one small set but an r-tuple T = (T r _i, . . . , To) G VI of small sets; the principle is 
the same. 

We already introduced the parameter r. Essentially, r will be the value of (ji(T), which 
is why we want r to be as small as possible. In fact, the theorem guarantees //(Tj) < 2r/C 
where £ is some small constant at our disposal. Often we shall take £ = l/12r! but 
sometimes it is useful to choose a smaller value. 

We use one more piece of shorthand. Let T = (T r _i, . . . , T\ , To) G V [n] and let w G [n]. 
Then we define Tn [w] = (T r _i n [w], . . . , T n [to], T n H). 

Theorem 2.5. Let G be an r-graph with vertex set [n], where d(v) decreases with v. Let 
r, £ > satisfy 8{G,t) < C- Then there is a function C : V'\p\ — > V[n], such that, for 
every independent set I C [n] there exists T = (T r _x, . . . , To) G 7^/ with 

(a) I C C(T), 

(b) /j(T ),...,MT r -i) < 2r/C, 

(c) |T |, . . . , |T r _i| < 2rn/C 2 , and 

(d) /i(C(T)) < 1 - 1/r! + 4C + 2rr/C. 

Moreover C has the online property, meaning that C(T) n = C(Tn [w]) n [to] /or all 
T G P r [n] and u> G [n]. 

In /aci, the above is true for all sets I C [n] for which either G[I] is \j r ^ l C l e{G) /n\ - 
degenerate or e{G[I\) < 2rr r e(G)/C- 

Remark 2.6. It is worth making a few observations at this point. 

• Roughly speaking, the theorem says that for each / there exists T C I with /x(T) < 
r, I C C(T) and /u(C) < 1 — 1/r!, provided r is large enough to make 5(G, r) small. 

• The online property is needed only for certain applications, principally Theorem l2.8l 
More is said about this in §4.41 
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• Ordering the vertices by degrees is generally unnecessary: it is used only to ob- 
tain (c) and the online property simultaneously, and to accommodate 6-degenerate 
graphs. 

• The container construction method makes essentially no use of the independence 
of the sets /, so we include an extension to two kinds of sparse subset, where either 
G[I] is 6-degenerate or e(G[J]) < bn, and b is small. As usual, we say G[I] is b- 
degenerate if for every subset J C I there is a vertex in G[J] of degree at most b. 
This is equivalent to saying that every subgraph G[J] is sparse; indeed if G[I] is 
6-degenerate then G[J] has at most b\J\ edges for all J C I, and conversely if G[J] 
has at most b\J\ edges for all J C / then G[I] is rfr-degenerate. 

• The theorem is best possible in two senses. First, the examples in £J7] show that 
our method cannot give /i(C) < 1 — 1/r!. Secondly, the bound on the measure 
of the generating sets Tj, implicit in the constraint 8(G,t) < £ which determines 
how small r can be, can not be improved significantly. One way to see this is that 
an improvement here would give an improvement in some of the applications, say 
Theorem II .111 but these are known to be best possible. 

As mentioned in the introduction, Theorem 1 2 . 5 1 has a variety of consequences and weaker 
forms which are easier to apply directly. These are discussed in ^5]-fj6l but we state a couple 
of them here for illustration. 

2.4. Tight containers. In ^we give a number of consequences of Theorem 12 . 51 concerning 
the number of edges e(G[C]) inside the container. The following is the weakest of these; 
it provides, in a handy format, a collection of containers each with few internal edges, the 
size of the collection being bounded by a simple function of r. 

Corollary 2.7. Let G be an r-graph on vertex set [re]. Let < e < 1/2. Suppose that r 
satisfies 5(G,t) < e/12r! and r < l/144r! 2 r. Then there exists a constant c = c(r) and a 
collection C dV\p\ such that 

(a) for every independent set L there exists T = (T\, . . . , T s ) € V s L with L C C(T) £ C, 
\Ti\ < crn and s < clog(l/e), 

(b) e(G[C}) < ee(G) for all C GC, 

(c) log |C| < clog(l/e)nr log(l/r). 

Moreover, (a) holds for all sets L C [n] for which either G[L] is [eT r ~ 1 e(G)/12r\n\ - 
degenerate or e{G[L\) < 24er!rr r e(G). 

2.5. Uniformly bounded containers. Next we give a consequence for applications when 
the size |C| of the container is of interest. As mentioned earlier, examples such as K^^—d 
show that it is not possible always to guarantee that \C\ is bounded. However, it is possible 
to bound \C n [v]\ for some initial segment [v] of the vertex set [re], and this can be done 
so that the number of different sets |C Pi [v]\ is small (a function of v rather than of n). 
More exactly, each container C "nominates" v = g(C), so that \C n [v]\ is small and not 
many containers nominate any given v. This consequence of the main result is suitable 
for dealing with list colourings, for example. The statement is actually in terms of tuples 
(Ci, . . . ,Ct) of containers rather than individual containers, since this is ultimately more 
efficient. The theorem is explained more in £}6j 
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Theorem 2.8. Let G be an r-graph on vertex set [n], for which the degree sequence is 
decreasing. Let < £ < 1/12H. Suppose that 5(G,t) < £, that r < C, 2 jr, and that k 6 [n] 
satisfies fi([k]) < (/2r\. Let t G N. 

Then there exists a collection C C V[n] and a map g : C* — > [fc, n], wrai/i i/ie following 
properties: 

(a) /or independent sets I there is some C £ C with I C C , 

(b) for all v 6 [n] 

log |{(Cin [«],..., c t nM) : g {c x ,...,c t ) = v}\ < fWiog(i/T), 

(c) and for all (d,...,^) G C* 

1 - 1 

• 1 

«=i 

where v = g{C\ , . . . , Ct) . 

Moreover, (a) holds for all sets I C [n] /or which either G[I] is [T r ~ 1 (^e(G)/n\ -degenerate 
or e(G[I]) < 2rT r e(G)/(. 

3. The Algorithm 

In this section we describe the method of building containers and establish the basic 
facts about them. 

Sapozhenko |43 1 l44 l H5] gives a way to build containers in the case of ordinary (2-uniform) 
graphs that are close to being regular. However even for graphs of this kind there is more 
than one approach which works. It is not obvious how to extend these graph methods to 
hypergraphs, each method offering a few plausible possibilities, most of which fail but some 
of which succeed. Indeed we eventually found more than one way to construct containers 
for regular 3-graphs, and one of these, which extended to regular r-graphs, was described 
in |47j : it is a random-based method. But we have found it difficult to find a method which 
fulfils the goals of working for general (not just regular) r-graphs and which also has the 
necessary online property. The method of [37] would be enough for some applications, such 
as list colouring, if it were true that each r-graph of average degree d contained a subgraph 
with degrees in the range c\d a to C2d a , for some constants c\, C2 and a. But it is possible 
to construct examples of r-graphs where this is far from the truth (these examples bear 
some similarity to the 2-graphs of Pyber, Rodl and Szemeredi [39])- 

The method given below fulfils all our requirements. The reason for the remarks just 
made is that, whilst we do attempt to motivate the construction, we cannot do so fully 
because it is hard to explain why one construction should work when another similar 
one does not. The devil is sometimes in the detail, and a prolonged discussion would be 
unjustified. 

There are two aspects to the construction: the production of a suitable small set T 
from a set /, which we call pruning, and the production of C from T, which we call 
building. (In fact, as mentioned already, we produce not just one small set but an r-tuple 
T = (T r _i, . . . , To) G V r I of small sets.) These two aspects are very closely intertwined, 
and it is convenient to describe both in terms of a single algorithm which has two slightly 
differing modes, a build mode and a prune mode. 
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3.1. Constructing containers. Given an r-graph G with vertex set [n], and a sequence 
of subsets T r _i, T r _2, . . . , Ti of [n], we produce a sequence of s-multigraphs P s for s = 
r — 1, . . . , 1. This means that P s is s-uniform but multiple edges are allowed; in other 
words E(P S ) is a multiset. 

Each edge {u s -i,u s -2, . . . ,uq} G E(P s ) with u s _i < u s _2 < • • • < uo will come from 
an edge {iv-i, f r -2, • • • , v s ,it s _i, it s _2, • • • , uo} G E(G), where tv-i < • • • < w s < u s _i and 
Uj G Tj, r — 1 > j > s. Equivalently, each edge of P s is an edge of P s +i whose first vertex, 
which is in T s , has been removed. The reason P s is defined as a multigraph, even if G itself 
does not have multiple edges, is so that distinct edges of G give rise to distinct edges of P s . 

The multigraph Pi is 1-uniform: its edges are single vertices. If / is an independent set 
and the sets Tj are chosen within /, as they will be, then evidently the members of E{P\) 
cannot be vertices in I, and so the container C can be chosen from vertices not in E(P\). 
Our first aim, then, is to ensure that E(P\) is as large as possible, and to this end we 
attempt to make E{P S ) large for each s. However this aim has to be balanced against 
keeping the sets T s small. 

Hence we shall choose a parameter r, so that, roughly speaking, T s will comprise a 
proportion r of the vertices (in degree measure), and we might hope the size of E(P S ) to 
be roughly t times the size of E(P s+ i). This means the average degree of P s will typically 
be around T r ~ s d. The parameter t is the same as that discussed in £ j2.1l and the constraint 
r > dT l ^ r ^ 1 ^ described there is precisely what is needed to ensure that E(P\) contains 
something worthwhile. 

However not every edge of G with its first r — s vertices in T r -\, . . . , T s will be admitted 
as an edge of P s , but only a selection of these. We do not allow edges into P s if they 
increase the degree of some vertex, or the degree of some subset a C [n], beyond some 
agreed threshold. We define the degree of a in the multigraph P s to be 

d s (a) = \{e G E(P B ) : a C e}\ , 

where we note that here we are counting edges with multiplicity in the multiset E(P S ). 
(Naturally we may write d s (y) instead of d s ({v}) if v G [n].) There are several reasons for 
wanting to bound the degrees in P s . One reason is the hope of keeping the vertex degrees 
near to r r_s times the degrees in G, so that degree measure in P s relates to measure in G; 
in particular, small sets of vertices cannot account for most of the edges of P s unless those 
sets have large measure in G. A second reason for controlling degrees of subsets is that 
only by doing so can we restrain the degrees of vertices at later stages: this comes out in 
the proof of Lemma 14.21 

So we proceed in the following way. We begin with P r = G, and then apply the straight- 
forward algorithm below to construct P s from P s +\ using T s , with s taking the values 
r — 1, r — 2, . . . , 1 in turn. During the application of the algorithm, the degrees d s (a) in P s 
will grow, as edges are added. We denote by T s the collection of vertices and subsets whose 
degrees have reached their bound, and we do not permit the addition to P s of any edge 
which contains a current member of T s . The set T s will grow too during the construction. 

As mentioned before, the algorithm can be run in two modes: prune mode and build 
mode. The aim of successive runs of prune mode is to produce sets T r _i, . . . , T 1} and the 
aim of build mode is to produce sets C r _i, . . . , C\ which will form the basis of a container. 
Thus in a single run of prune mode, an independent set / is input and some set T s is 
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output, whereas in build mode a set T s is input and a set C s is output. Both forms of the 
algorithm include the construction of P s from P s +i. 

The essential difference between the modes is this. The algorithm inspects the vertices 
in [n] one by one. In prune mode, where I is part of the input, if the vertex is in / a 
decision is made, according to some rule, whether to place the vertex into T s . In build 
mode, where T s is input but / is not, the vertex is inspected to see whether it passes the 
rule; if it passes, but the vertex is not in T s , we know it could not have been in /, so it is 
removed from C s (with the aim of making a container via C s ). In both modes, if the vertex 
is in T s , appropriate edges are added to P s . 

Two real numbers are included in the input to the algorithm. The parameter r is the 
more important and has already been discussed. The parameter £ is a small constant, 
often in practice chosen to be l/12r! but sometimes chosen smaller. It is used in the rule 
to decide membership of T s . 

The independence of the set / is not actually used by the algorithm, and it is useful to 
define the algorithm for general subsets / C [n]. 

Algorithm 

input an r-graph G on vertex set [n] 

an (s + l)-multigraph P s +i on vertex set [n] 

parameters r, £ > 

in prune mode a subset I C [n] 

in build mode a subset T s C [n] 

output an s-multigraph P s on vertex set [n] 
in prune mode a subset T s C [n] 
in build mode a subset C s C [n] 

put E(P S ) = and T s = 
in prune mode put T s = 
in build mode put C s = [n] 

for v = 1, 2, . . . , n do: 

let F = {/ S [v + 1, n] W : {v} U / € E(P S+1 ), and Va G T s a £ f } 

[here F is a multiset with multiplicities inherited from E(P s +\)] 
in prune mode if \F\ > Qr r ~ s ~ l d{v) and v G J, add v to T s 
in build mode if \F\ > ( > T r ~ s ~ 1 d(v ), remove v from C s 
if v G T s then 

add F to £(P S ) 

for each u E [v + l,n], if d s (u) > T r ~ s d(u), add {n} to T s 
for each cr G [v + l,n]^ >:l ), if d s (o~) > 2 s Td s+ i(a), add a to T s 

The algorithm adds to P s all s-edges which, with v G T s as first vertex, form an edge of 
-P s +i and which do not contain (at that moment) any subset in T s . The degree threshold 
for a vertex entering T s is in terms of its degree d{u) in the original graph G, whereas for 
a larger subset a it is in terms of its degree in P s +i', this difference is for technical reasons 
arising in the proof of Theorem 12.51 
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Clearly, if in prune mode we construct T s from /, then, when in build mode with the 
same set T s as input, the condition \F\ > C i r r ~ s ~ l d{v) can happen only if either v I 
or v £ T s . Evidently, therefore, I C C s U T s , so C s U T s is one option for a container 
for /. Another option for a container, as mentioned earlier, is [re] minus the vertices 
in E(P\). In particular, if / is independent then I C [re] — T%. (Strictly speaking, the 
set ]?i is a set of singletons of vertices rather than a set of vertices, but we identify Ti 
with {u £ [re] : {u} £ Ti}. Furthermore Ti is not output by the algorithm, but it is 
easily recoverable from Pi. Indeed, P(Pi) is just a multiset of vertices, and d\{u) is the 
multiplicity of u in E(P\). By construction, T\ = {u £ [re] : di(u) > T r ~ 1 d(u)}.) So each 
of C s U T s , 1 < s < r — 1, and [n] —T± is a container for /; our aim is to ensure that at 
least one of these is a good container, meaning that is not close to [re]. 

Here then is a way of viewing the operation of the algorithm. If Ti is large then [n] — Ti 
is a good container for /. If Ti is not large then, since the degrees in Pi are bounded, the 
average degree of Pi must be small. But P r = G, whose average degree is not small, so 
there must be some s for which P,+i has large average degree (of order r r ~ s ~ 1 d) but P s 
has small average degree (much smaller than T r ~ s d). Since the degrees are bounded, there 
must have been plenty of vertices of P5+1 which could have contributed edges to E(P S ) 
but did not do so. Why did they not do so? Only because they are not in / and so not 
available for T s . These are exactly the vertices which are removed from C s : hence for this 
value of s, C S L)T S will be a good container for I. 

3.2. Properties of the construction. We are thus lead to two important definitions. 

Definition 3.1. Let G be an r-graph on vertex set [n] and let Id [re]. Let r, C, > 0. Let 
T r _i,...,Ti be the sets constructed by repeated applications of the algorithm in prune 
mode. Let Tq = I DTi. Then we define 

T(G,I,t,Q = (T r _i,...,Ti,r ) G V r I. 

The r-tuple T is the fruit of running the algorithm in prune mode, from which the 
container for / will be built. As noted earlier, if / is an independent set then To = InTi = 
0. Hence the introduction of To is unnecessary if we wish to find containers only for 
independent sets, but by introducing To we can produce containers for non-independent 
sets too. It will turn out that To is small, as desired, provided / is sparse; see 34.21 

Now comes the main definition — that of containers. 

Definition 3.2. Let G be an r-graph on vertex set [re] and let T = (T r _i, . . . ,Ti,To) G 
V'[n}. Let t, ( > 0. Let C r _i, . . . ,C% be constructed by repeated applications of the 
algorithm in build mode, using T r _i, . . . ,T\. Let Co = [n] — T\. The container C(G, T, r, () 
is then 

C(G,t,t,0 = (C r _ina-2n---nc* 1 nCo)uT r ._iUT r _ 2 ---uT 1 uT . 

Lemma 3.3. If T = T(G, I, r, C) then I C C(G, T, r, Q . 

Proof. We noted earlier that / C C s U T s for s > 0. Moreover, / C Co U To by definition, 
since C = [re] - Ti and T = InT 1 . Hence / C C(G, T, t,Q- □ 

Before computing the size of the containers C(G,T,t,Q and the number of them, we 
make note of their online property, namely that C(G,T,t,() H [w] is determined just by 

Tn [w]. 
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Lemma 3.4. Let G be an r -graph on vertex set [n] and let T G V r [n}. Then, for each 
we[n], C(G, T, t, C) n [w] = C(G, T n H , r, C) n [w] holds. 

Proof. A little reflection on the algorithm makes the lemma clear. Suppose that T s n [w] 
is given, together with the multiset of edges e S E{P s+ i) such that e fl [w] / 0. Then the 
algorithm, run with v = 1, . . . ,w rather than v = 1, . . . , n, will correctly produce the edges 
e £ E(P S ) such that efl [w] ^ and will find all a GT S such that <rn[w] ^ (together with 
some other members of E(P S ) and T s that we shall not need). Moreover, when running 
in build mode, this restricted version of the algorithm will correctly determine C s n [w]. 
Therefore, by running the restricted algorithm for s = r — 1, r — 2, . . . , 1, we can determine 
C r _i n [w], . . . ,C\ n [w]. Finally, since we have determined Fx D [w] we can find Co H [w], 
and because we are given Tq n [to] this means we know C n [w] . □ 



4. Container calculations 

In this section we estimate the measure of the tuples T(G, /, r, £) and of the containers 
C(G,T,r, (), thereby proving Theorem 12.51 



4.1. Degrees and co-degrees. Before making these estimates we need information on 
how large the degrees can be in P s . The intention behind the set T s is to prevent degrees 
being much larger than the target degrees, namely T r ~ s d{u) for the vertex u; after the 
degree of u attains this level, no further edges containing u are added to P s . However, 
when a vertex u enters T s , it does so because some multiset F has been added to E(P S ). 
Since F can include many edges that contain u, the degree d s (u) can increase significantly 
in one step, from an initial value at most the target value T r ~ s d{u) to something much 
larger. The extent of this problem depends ultimately on the way the edges of G overlap 
each other. 

The reason T s is defined the way it is in the algorithm, is to keep control of the degree 
problem without increasing r more than is necessary. This can be expressed succinctly in 
terms of the co-degree function S(G,t) introduced in §2. It 

First we need a small calculation. 

Lemma 4.1. For 2 < s < r and 2 < j < s, let al be given by the equations a/ = Sj 
and a,s = 2 s a^ x + for s < r, where Sj was defined in Definition \2.2L Then < 

4 2 ~ s 5(G,t) holds for s>2. 

(2) (2) (2) (2) 

Proof. Since a s > 2 s a^i > ^ a s +D ^ ^ s enough to prove that a 2 < S[G, r). Now by dint 

(i) 

of the definition it is clear that a; s is a linear combination of the numbers 5j+£, I > 0. We 

claim that the coefficient of Sj+i in a s is at most 2^' \ 2 )^ . This is certainly true if 
s = r, since the only positive coefficient is that of Sj (i.e. I = 0). For s < r we may prove 
the claim on the assumption that it is true for s + 1. If I = then the coefficient of Sj + £ 

in ai+V'' is zero, and the claim follows because 2(2) ~( t ) +e = 2 s 2(a)~( + 2 + If £ > 1 we 
have 
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and the claim follows in this case too. Hence the claim always holds, and so 

4 2) < 2©g 2 -('t 2 )+V* = 2©- 1 ^2-(V) 5j = S (G,r), 

£=0 j=2 

by definition of 5(G, r). □ 

Here is the main lemma about degrees in P s , explaining the role of the co-degree function 
S(G,r). 

Lemma 4.2. Let G be an r-graph on vertex set [n] with average degree d. Let P r = G and 
let P s -i, ...,Pi be the multigraphs constructed by successive applications of the algorithm. 
Then 

^2d s (u) < r r - s nd(fi{U) + 4}- s 5{G,t)) 
ueu 

holds for all subsets U C [n] and for 1 < s < r. 

Proof. By analogy with Definition 12.21 we define 

d^\u) = max{d s (cj) : u G a G [n] (j) } , 

for j > 2, where here it is the final values of these quantities that are used — that is, we 
measure these quantities in the output multigraph P s . 

When s = r the lemma is true by definition of n{U), so from now on we assume s < r — 1. 
Suppose that a G [n}^\ If a ^ T s then d s (a) < 2 s rd s+ i(a). If a G T s then a was added 
to T s after some vertex v G T s was inspected and F was added to E(P S ). Before this took 
place, d s (a) < 2 s Td s+ \{a) held; since the number of edges of F containing a was at most 
d s+ \{a U {v}), the final value of d s (cr) satisfies d s (a) < 2 s rd s+ i(a) + d s+ \{a U {v}). This 
inequality holds for all a G [n}^\ 

Let u G [n]; then ds(u) = d s (a) for some a G [n\^\ so 

(1) < 2 s rd s+1 (a) + d s+1 (a U {v}) < 2 s rd%{u) + d^ ] \u) . 
We claim that 

u£[n] 

where a { P was defined in Lemma 14.11 Indeed, for s = r the claim (with equality) is just 
the definition of 6j, and for s < r — 1 it follows immediately by induction (on r — s) from 

inequality (pQ) and the definition of afs\ Hence, for s > 1, we have by Lemma 14.11 

(2) Y, di 'U u )<^ S ^ Sn d6(G,r). 

u£[n] 

Now let u G U. If u ^ T s then d s (u) < T r ~ s d(u). If u G T s then u was added to T s 
after some vertex v G T s was inspected and F was added to E(P S ). Since i 7 has at most 
d s+ i({u,v}) edges containing u, the degree of u is at most T r ~ s d(u) + d s +i({u, v}). Now 
d s+ i({u,v}) < dg/^u) so, using ([2]), we have 

J2 d s{u) < ^r r - s d(w)+d2i(«) < r r - s n^(C/) + 4 1 - s r r - s nd5(G,r), 
«e(7 net/ 

which establishes the lemma. □ 
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4.2. The measure of the sets T s . We now estimate the measures of the sets T s . 

Lemma 4.3. Let I C [n] and T = T(G,I,t,() = (T r _i, . . . , T x , T ). Then fj,(T s ) < 
(t/C)(1 + S(G,t)) forl<s<r-l. 

Proof. The set T s is output when the algorithm is run in prune mode. During the run of 
the algorithm, each vertex v which enters T s contributes a set F of at least C,T r ~ s ~ l d{v) 
edges to E(P S ). Therefore, writing d for the average degree of G, Lemma 14.21 yields 

Qr r - s - l nd^{T s ) = ]T Qr r - s - l d{v) < e(P s ) < ]T d s (u) 

v&T s ue[n] 

<T r ~ s n ( i(l + 4 1 - s 5(G,T)) 
and this proves the lemma. □ 

The set To needs a different argument. As noted before, To = if / is independent. 

It turns out that we can allow / to be 6-degenerate, for fixed b, at essentially no cost 
compared with / being independent, though to handle this case we shall need to make sure 
the degree sequence of G is decreasing. On the other hand, requiring only that e{G[I\) < bn 
will incur a small cost, which is that r must increase typically from around c? -1 /^ -1 ) to 
around d~ l / r in order to keep To small (d being the average degree of G). 

Lemma 4.4. Let G be an r-graph on vertex set [n] with average degree d. Let L C [n] and 
T = T(G,I,t,C) = Cr r _i,...,Ti,r ). Ife(G[I\) < bn then fi(T ) < T l ' r d~ l b. If G[I] is 
b-degenerate and d(v) > r 1 ~ r rb for v G [m], then h(Tq n [m]) < (t/£)(1 + 5(G, t)). 

Proof. Recall that T = / n Ti, so d^v) > r^d^) for each v G T . Let J = T r _i U 
• • • U T\ U To C I. Recall that distinct 1-edges {v} G Ti correspond to distinct r-edges 
{f r _i, C / with tv_i < • • • < v\ < v and v s G T s , so these edges lie in G[J]. It 

follows that r^nd^To) < *£ veTo d^v) < e{G[J}). 

If e(G[I\) < bn then we simply have r r - 1 nd/i(T ) < e(G[J]) < e(G[I\) < bn, so fi(T ) < 
r 1_r d _1 6. 

If G[I] is 6-degenerate then we consider J* instead, where S* denotes S n [m] for any 
S C [n]. If v G Tq then each of the above-mentioned edges {w r _i, . . . , Ui, v} lies within J*, 
since w r _i < • • • < V\ < v. Therefore T T ~ 1 ndfi{TQ ) < e(G[J*]) < b\J*\, by ^-degeneracy. 
Then, by definition of [m], 

r r - l nd^)<b\r\ < -jA- E d ^ = ^ndn(J*). 

nSJ* 

Therefore r/x(T *) < /x(J*) < ^(T^J + • • • + /x(T*), and so (r - l)/i(T *) < ^T*^) + ■■■ + 
fx(T*) < /x(T r _i) + • • • + A*(Ti), from which the result follows via Lemma 14.31 □ 

4.3. The measure of the container C(G,T,t,Q. We now prove the crucial fact that 
the measure of the container C(G, T, r, £) is bounded above by some constant less than one. 
This can be established with a fairly simple argument, but just a little more care yields a 
bound close to 1 — 1/rl, and this is best possible, as shown in £J71 

Lemma 4.5. Let T = (T r _i, . . . ,T ) G V r [n]. Then 

r— 1 

MC(G, T, r, 0) < 1 - -j + T C + -5(G, r) + E ^(T s ) . 

s=0 
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Proof. Recall from Definition 13.21 that Co = [n] — T\ and that C r _i, . . . , C\ are constructed 
by the algorithm in build mode. Let C = C r _i Pi • • • D Cq. We define 

D 1 = [n]\C 

D 2 = {v£ [n] : {«} £T 2 ,v(£ D x } 

D 3 = {v G [n] : {v} £T 3 ,v(£ (D 1 U D 2 )} 

D r -i = {ve [n] : {v} G T r -i,v £ {D x U • • • U £> r _ 2 )} 
£> r = [n] \ (D x U • • • U A-i) • 

Evidently, Dx, . . . , D r form a partition of [n]. Since C(G, T,t,Q = C U T r _i • • • U To, it is 
enough to prove that n(Dx) > 1/r! - 11C/4 - 6(G,t)/4. 

It is convenient to define D <s = Dx U • • • U D s _x, D< s = D s U Z)< s and D >s = [n] \ D< s . 
For s > 2 we also need certain subsets of the edges of P s : 

x s = {feE(P s ) : |/ni>< s |>i, |/nr> >s |>2} 

Y s = { f G £(P S ) :/cD> s } 

= { / G Y s : a C / for some cr G r s _i, |<r| > 2 } . 

We further define the numbers x s ,y s ,z s by \X S \ = x s T r ~ s nd, \Y S \ = y s T r ~ s nd and \Z S \ = 
z s T r ~ s nd, where d is the average degree of G. Observe that X 2 = Z 2 = 0, that is, x 2 = 
z 2 = 0. 

Note that X s Pi Y s = and each member of E(P S ) \ Y s meets D <s . So 

d s (v) = \f nD °\ 

v&D a feE(P s ) 

/eB(F s )\(x s uy s ) /ga s /er s 

< ( a - i)|£(p s ) \ pf s u y s )| + (a - 3)|X S | + s |y 8 | 

= (s-l)\E(P s )\Y s \-2\X s \ + s\Y s \ 

<(s-l) d s (v)-2\X s \+s\Y s \ 

veD <s 

<(s- l)r r ~ s nd ( fi(D <a ) + A 1 ~ S S(G, r)) - 2|X S | + s\Y s \ , 

where the last line employs Lemma 1-4.21 

Now d s (v) > r r ~ s d(v) for all v G D s : for s < r this is because {v} G T s , and for s = r it 
holds trivially. Hence T r ~ s ndfj,(D s ) < Y1 V <=D d s (v), so we obtain (jl(D s ) < (s — l)((j,(D <s ) + 
4 1 ~ S 5(G, t)) — 2x s + sy s . Adding fj,(D <s ) = fj,(D< s -x) to each side gives 

p{D< a ) < s/i(L>< s _i) - 2x s + sy s + (s- 1)4 1 " S ( 5(G, r) 

for each s > 2. Multiplying this inequality by 1/s! and summing over s = 2, . . . ,r, noting 
that n(p< r ) = 1, D<x = Dx and x 2 = 0, we obtain 

s>3 ' s>2 V ; ' 

where we used J2 S >2^ 1 ~ S ( S — < 1/4. 
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Suppose s > 2 and / G Y s \ Z s . If c G and a C / then |cr] = 1, say <r = {u}. 

But {«} G implies u G D <s by definition of D s —i, which contradicts / G Y s . Thus / 

contains no member of r a _i. Let v be the first vertex of /. Then v £ D± since s > 2, so 
w G C C C s _x. By the construction of C s _i, v is the first vertex of fewer than (^T r ~ s d(v) 
edges of P s that contain no member of T s _i, so it is the first vertex of fewer than ( i T r ~ s d{v) 
edges in Y s \ Z s . Therefore \Y S \ — \Z S \ < Y2veD > C TV ~ s d(v) = (T r ~ s ndfi(D> s ). Hence 
Us — z s < CfJ-(D> s ) < C- In particular yi < £, because Z2 = 0. 

Now let s > 3 and put S = {a G : \a\ > 2, a C D> s }. By definition of Z s , each 
member of Z s contains a member of S. Therefore 

z s T r ~ s nd = \Z S \ < d s {a) 

- XI 2s _ 1 4-i(o") by definition of T s _i 

< — |{/ G ^(P s -i) : a C / for some a G 5}| 
r 

< - uy s _i| = (x s _a +y s _i)r r_s nd. 
r 

Hence z s < x s _i + y s _i for s > 3. Since y s < z s + £ this means y s < + + C; by 
repeating and applying both X2 = and y2 < C> tbis yields y s < x s _i + x s _2 + • • • + x% + 
(s — 1)C for s > 3. The inequality holds for s = 2 also. Substituting this inequality into 
inequality ([3]) we obtain 

i < + E «. U + £ i) + C E ^ + r) . 

S 3 \ J — S J S 2 

The coeffient of x s is negative, and so 1/r! < n{D\) + 11^/4 + 5(G, t)/4, which is what we 
needed to prove. □ 

4.4. Proof of Theorem 12.51 The simple and obvious choices for T and C in Theorem 12.51 
are T = T(G, I,t,() and C(T) = C(G,T,t,Q. Indeed this choice works perfectly well for 
parts (a), (b) and (d) in the (most important) case when / is independent, and also when 
e(Cr[I]) is bounded. However for the case when G[I] is 6-degenerate a different choice is 
required. We need containers determined by small sets of vertices none of which has low 
degree. 

The online property offers, as a by-product, a convenient way to achieve this aim, which 
furthermore has the side benefit of yielding (c). We label the vertex set so that the degrees 
are decreasing, and hence the vertices of small degree are specified by some terminal segment 
of [n]. We then run the algorithm just on the initial segment [m] of vertices of high 
degree, and add the trailing segment [m + 1, n] to the container. These modified containers, 
determined by subsets of [m], still have the online property. 

Lemma 4.6. Let G be an r -graph on vertex set [n] and let m G [n]. Let L C [n], let 
S = T(G,L,t,() and let T = Sn[m]. Put 

C = C(G,T,t,()U [m + l,n]. 

Then C is a container, determined by T, with the online property: that is, I C C, and 
C n [w] is determined by T n [w] for all w G [n]. 
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Proof. The lemma is a simple consequence of Lemma 13.41 □ 

Proof of Theorem \2.5[ Notice that the theorem is trivial if £ > l/4r!, since in that case 
the function C(T) = [n] works, with T = (0, . . . , 0) representing all I. Recall too from £ )2.1I 
that 5(G,t) — > as r — > oo. Hence the condition 5(G,t) < £ is satisfiable by making r 
large enough, although if r > C/^ r the theorem is again trivial. 

Let d be the average degree of G. We define m G [n] by [to] = {v G [n] : > £d}. 
Let 5 = T{G,I,t,() £ ^ r [n] and write 5 = (5 r _i, . . . , 5 ). We then put T = SO [to], 
so T = (T r _i, . . . ,To) where T s = S s n [m], < s < r — 1. Finally, we define C(T) = 
C(G,T,t,C) U [to + l,n]. We shall show that the theorem holds with this choice of T 
and C. 

First, note that (a) and the online property both hold by virtue of Lemma 14.61 To show 
that (b) holds, we apply Lemmas 14.31 and 14.41 notice that the sets denoted by T s in those 
lemmas are here denoted by S s . For s > 1 we obtain //(T s ) < fj,(S s ) < 2r/£ by Lemma I3~3l 
because, as remarked, may assume 5(G,t) < £ < l/4r!. As for To, if I is independent then 
To = 0. If G[T\ is 6-degenerate then Lemma |4~41 shows //(To) = /i(5oH [m]) < 2t/£, because 
d(v) > Qd > T 1 ~ r rb for v G [to]. If e(G[T\) < bn then Lemma 14.41 shows //(To) < //(So) < 
t r d~ l b < 2r/C- Hence (b) holds in every case. 

Since T s C [to] for each s, we have |T s |<^ci < X^eT = n d^{T s ), so (c) is a consequence 
of(b). 

Now //(C) < h(C(G,T,t,C)) + //([to+ l,n]). Lemma S3] applied to C(G,T,t,() shows 
fi(C(G,T,r,0) < l-l/r! + 3C+2rr/C, because (b) holds. Finally, note that //([m+1, n]) < 
C by definition of [to], so (d) holds and we are done. □ 

5. Tight containers 

The method of Theorem 12.51 will not, in general, give containers of measure less than 
1 — 1/r! (we shall give examples to show this in On a more positive note, it is possible 
via iteration to obtain smaller, almost optimal, containers, at the cost of sacrificing the 
online property. Some applications, most notably list colouring, depend crucially on the 
online property, but for many applications it is unnecessary, and in such cases it is generally 
well worthwhile using smaller, iterated, containers. 

Let G be an r-graph of order n and average degree d, and let / be an independent set 
in G. Suppose we have a container C for / with //(C) < 1 — c, such as that supplied by 
Theorem 12.51 where c is close to 1/r!. It follows that the induced subgraph G[C] cannot be 
dense; to be precise, 

(4) e(G[C]) < (1/r) £ d(v) = (l/r)nd^C) = //(C)e(G) < (1 - c)e(G) . 

Theorem 12.51 can now be re-applied, this time to the hypergraph G[C], to get a container 
C C C, with e{G[C']) < (1 - c)e{G[C}) < (1 - c) 2 e(G). Repeating this operation often 
enough will give a container with o(e(G)) edges. 

The reason that the online property of Theorem 12.51 is lost following iteration is that, 
in order to compute, say, C' n [v], it is necessary to know the whole of G[C], for which it 
is necessary to know the whole of C, not just C fl [v]; hence C' n [v] cannot be computed 
online. 

Another drawback of iteration is the increase in the number of sets T needed to specify 
the eventual container, leading to a greater number of possible containers. Usually in 
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practice this turns out to be unimportant. Here is a simple lemma to help count the 
number of containers being generated. 

Lemma 5.1. There are at most exp{s#n(l + log(l/0))} s-tuples of subsets T±, . . . ,T S C [n] 
with \T\ \ + . . . + \T S \ < sOn, where 9 < 1. 

Proof. Let there be Nj such s-tuples with |Ti| + . . . + \T S \ = j. We wish to bound N = 
Nq + N\ + . . . + -/V|^0 n j . Plainly, the ordinary generating function Nq + N\x + A^x 2 + . . . 
for the sequence (Nj) is equal to ((1 + x) n ) s . Therefore, since 9 < 1, we have 9 sdn N < 
(1 + 9) ns < e sdn . □ 

The next theorem is a version of Theorem 12.51 more suited to iteration. 

Theorem 5.2. Let G be an r-graph on vertex set [n]. Suppose that 5(G,t) < l/12r! and 
that t < l/144r! 2 r. Then there exists a collection C C V[n] such that 

(a) for every independent set I there exists T = (T r _i, • • • > Tq) E V r (I) with I C C(T) G 
C and \Ti\ < 288r! 2 rn, 

(b) log |C| < 288rr! 2 nrlog(l/r), and 

(c) e(G[C]) < (1 - l/2r!)e(G) /or all C e C. 

Moreover, (a) holds for all sets I C [n] /or which either G[I] is \r T ~ 1 e(G) /12r\n\ - 
degenerate or e(G[I}) < 2Ar\rr r e(G). 

Proof. We apply Theorem 12.51 to G with £ = l/12r! (we may assume the degree order 
of G is decreasing). For each set / we have T = (T r _i, . . . ,Tq) and a container C(T) 
satisfying properties (a)-(d) of that theorem. Take C to be the collection of all such C. 
Since r < C 2 / r ) we have 2rr/C < 2£, so //(C) < 1 — 1/r! + 6£ = 1 — l/2r!. It follows from 
inequality © that e(G[C]) < (1 - l/2r!)e(G). 

Hence (a) and (c) of the present theorem are satisfied and it remains to check (b). 
Theorem 12.51 tells us that each container C is specified by sets To, . . . ,T r _i each of size at 
most 9n, where 9 = 2t/( 2 = 288r! 2 r < 2/r < 1. By LemmaO we have 

log |C| < r0ra(l + log(l/0)) < r0n log(l/V) = 288rr! 2 nr log(l/r) , 

establishing (b) and completing the proof. □ 

Theorem 15.21 is easy to apply iteratively. Though a similar theorem can be derived using 
the proof method of Theorem 3.1 of [47J in place of Theorem 12. 5\ which actually gives a 
better bound in (c), with a polynomial in r in place of r!, the bound in (b) is worse, with 
r 1 / 2 in place of r. We remark, though, that it is not possible to iterate Theorem 3.1 of |47j 
directly, because it applies only to regular r-graphs, or, more precisely, to r-graphs with no 
large sparse subset. Even if G itself is regular, G[C] might be far from regular. 

The next theorem is the result of applying Theorem 15.21 repeatedly to end up with a 
collection of sparse containers. Its appearance is rather technical but it is in a form which 
can be readily applied. 
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Theorem 5.3. Let G be an r -graph on vertex set [n]. Let eo < e(G). Suppose that, for 
each U C [n] with e(G[U]) > eo, the function t(U) satisfies both 5(G[U],t) < l/12r! and 
t(U) < l/144r! 2 r. For e < m < e(G) define 

f{m) = max{-\U\r(U) log t(U) : U C [n], e(G[U]) > m] 
t* = max{ t(U) : U C [n], e(G[U]) > e } 

Let k = log(eo/e(G))/ log(l — l/2r!). XTien t/iere exists a collection C C "P[n] suc/i i/iai 

(a) for every independent set L there exists T = (Ti, . . . ,T S ) G V s (L) with L C C(T) 6 
C, |Ti| < 288r! 2 r*n and s < (fc + l)r, 

(b) e(G[C]) < e /or a// C G C, 

(c) log |C| < 288rr! 2 E <«< fc /(eo/(l " l/2r!) 4 ). 

Moreover, (a) holds for all L C [n] for which either G[L] is Lr(C/) r ~ 1 e(G[C/])/12rr!|[/|J - 
degenerate or e(G[I]) < 2Ar\rT{U) r e{G[U]), for all U C [n] with e(G[U]) > e . 

Proof. We will show that for all t with eo < t < e(G)/(l — l/2r!), there exists a collection 
Ct C V[n] satisfying conditions (a)-(c), where the constant eo has been replaced by t in 
(a)-(c), and k is replaced by k(t) = log(t/e(G))/log(l — l/2r!). 

When t > e(G), we may take Ct = {[n]}. Otherwise, suppose t < e(G). It is enough to 
show that Ct exists provided V = C i /n_ 1 / 2r n exists. Each D G V is specified by a tuple 
T' = (Ti,...,7V) withs' < (Jfe(t/(l-l/2r!)) + l)r = k{t)r. I£e(G[D]) <t,letC t (D) = {D}. 
Otherwise, apply Theorem 15.21 with r = r(D) < r* to the r-graph G[D], and let Ct(D) be 
the collection of containers given by the theorem. Then put Ct = y] D&v Ct{D). 

If C G C{D) then C is specified completely by T', together with the r-tuple appearing in 
condition (a) of Theorem 15.21 if the theorem was applied. Hence C is specified completely 
by a tuple of size at most (k(t) + l)r, so satisfying condition (a). If D G T> then either 
e(G[D]) < t in which case |Ct(D)| = 1, or e(G[D]) > t in which case 

log |C*(Z?)| < 288rr! 2 |L>|r(L>)log(l/T(L>)) < 288rr! 2 /(t). 

Hence 

log |C t | < log|2?| + 288rr! 2 /(t) < 288rr! 2 ^ f{t/{\ - l/2r!)*). 

0<i<fc(t) 

Finally for C G Cf(-D), note that e(G[C]) < t, since if e{G[D\) > t then by condition (c) of 
Theorem IQ1 e(G[C}) < (1 - l/2r!)e(G[L»]) < t. □ 

For certain applications the technical detail of Theorem 15.31 is not needed; what is re- 
quired is a simple statement that a few iterations will produce a container with a negligible 
proportion of the original edges. Such a statement was presented earlier as Corollary 12.71 

Proof of Corollary Let e = ee(G). Observe that for U C [n], if e(U) > ee(G) then 
5(G[U],t) < 5(G,r)/e < 1/12H. Therefore we may apply Theorem 15.31 to the graph G 
with eo = ee(G) and r(U) = r for all U. Then r* = r and f(m) = nrlog(l/r). Hence we 
obtain a collection C satisfying conditions (a) and (b) of the corollary, and 

log|C| < 288r! 2 r (l + ^^y^ ) nrlog(l/r), 
giving condition (c). □ 
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6. Uniformly bounded containers 

Theorem 12.51 provides containers of bounded degree measure, and £j5] provides containers 
with few edges inside. For some purposes, though, what is required is containers of bounded 
size, that is, of bounded uniform measure. 

For regular hypergraphs the results of $5] can be used directly. If G is (f-regular and 
C C [n] is such that e(G[C]) = o(nd) then \C\ < (1 — l/r + o(l))n. However for non-regular 
hypergraphs we get no such information. Indeed, it is perfectly possible to have n{C) and 
e(G[C]) both small whilst \C\ is close to n. Hence it is not possible to guarantee a container 
of bounded uniform measure. 

Instead, what we can show is that there is some initial interval [v] C [n] such that 
|Cn [v]\ is bounded. The basic lemma which translates information about /U-measure into 
information about uniform measure is the following one. In the lemma, S is a multiset, so 
fJ-(S), \S Ci [v]\ and so on have their natural interpretations counting with multiplicities. 

Lemma 6.1. Let /z : [n] — > R be a measure with /i(l) > fi(2) > ■ ■ ■ > //(n), and let S C [n] 
be a multiset. Then 

afi({v£[n] : \Sn[v]\ > av }) < n(S) 

holds for all a > 0. 

Proof. Let W = {v : |5n[w]| > av}. We must show afJ,(W) < fi(S). Let W = {wx, ■ ■ ■ ,w k } 
where k = \W\ and w\ < W2 < ■ ■ ■ < u>k- Define the numbers sx, ■ ■ ■ , Sk by sx = \S fl [wx]\ 
and Si = \Sn + 1, wi\\ for i > 2. Then we have fi(Sn [wi-x + l,Wi]) > Sifi(wi), because 
> fi(2) > ■ ■ ■ > /i(n). Therefore 

n(S) > fi(S n [wx]) + fi(S n K + 1, w 2 }) + • • ■ + //(5 n K-i + 1, w k ]) 

> sxn(wx) + s 2 n(w 2 ) -\ \-s k fJ>(w k ) 

k 

= an(wi) + (sx H h Si - ai)(ji(wi) - n(w i+1 )) , 

i=l 

(5) = a^(W) + > -I \- Si- ai)(fj,(wi) - fj,(w i+1 )) , 

i=i 

where [i{wk + i) is defined to be zero. Now 15 n = si + • • • + S{ holds for 1 < i < k, 
and so si + • • • + Sj > awi, because Wi S W. In particular, sx + • • • + Sj > m, since 
Wi > i. Moreover // is a decreasing function, so each summand in ([5]) is non-negative, and 
the lemma follows. □ 

In fact we shall need not just that \C Pi [v]\ is bounded for a single container C but that 
the average ^i=i l^i 1 " 1 HI is bounded for a collection C\, . . . ,Ct- The next lemma prepares 
the way. 

Lemma 6.2. Let \i be a probability measure on [n] with > /x(2) > • • • > /u(n) > 0. Let 
T\, . . . , T s , Cxi ■ ■ ■ ,Ct be subsets of [n], with fJ,(Ti) < A for 1 < i < s and n(Cj) < 1 — c — rj 
for 1 < j < t, where c, r/ > 0. Suppose moreover that k E [n] and //([&]) < r/c. T/ien there 
exists v G [fc, n] mi/i 

1 v— -v A 1 .* ^ 

-^|T» n [v]\ < -v and -^(Qn [v}\ < (1 - c)u . 

s i=i 77 * i=i 
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Proof. Let U = {v : Y^i=i ^ HI — sXv/rj}. Writing 5 for the multiset which is the 
disjoint union of Xi, . . . ,T S , so that n(S) < sX and \S Ci [v]\ = J2t=i l^i ^ HI' we can a PPly 
Lemma UTTl with a = sX/rj to obtain //(?/) < (jl{S)/ol < rj. 

In like manner, let W = {v: Y? i= i \Ci n H| > t(l - c)«}. Writing now iS for the multiset 
which is the disjoint union of Ci, ...,C t , so that //(S) < t(l-c-rj) and |5n[u]| = Y/ i= i \ c i n 
[v]\, we apply Lemma [6TT1 with a = t(l — c) to obtain /i(W) < t{l — c—rj)/a = 1 — rj/(l — c). 

It follows that /i(C/ UWU [k]) < rj + 1 — 77/(1 — c) + rjc < 1, so there exists t> G [n] not 
contained in U U W U [fc] . This v satisfies the conditions of the corollary. □ 

We can now prove the main result about containers and uniform measure, which was 
stated earlier as Theorem 12.81 

Proof of Theorem \2.8[ Apply Theorem [23] to G to obtain a collection C of containers C(T) 
for T = (T r _i,...,T ) E V r [n}. Since r < ( 2 /r we have 2rr/C < 2( so fi{C(T)) < 
1 - l/r! + 6C 

Let (Ci, . . . ,Ci) G C*, where t G N. Each Cj is specified by an r-tuple of sets Tj, so 
the whole collection (Ci,...,Cf) is specified by rt sets which, after re-labelling, we call 
Ti,...,T rt , with n(Ti) < 2r/C for 1 < i < rt. Let c = 1/r! - 8C and 77 = 2£. Then 
^(W) ^ < ??c so the conditions of Lemma 16.21 are satisfied with s = rt and A = 2r/C, 

and we may choose v G [k,n] with 

-J2\Tin[v]\<^v and - ^ |Ci n [v]\ < (1 - - + 8Qv . 

i=l ^ i=l 

Define ^(Ci, . . . , Ct) = u. Then (a) and (c) of the theorem are satisfied. 

To obtain (b), we need that the containers have the online property: in other words, the 
i-tuple {C\ n [v ],..., Ct n [v]) is determined by T\ n [u], . . . , T s n [w]. This online property 
is guaranteed by Theorem 12.51 Hence the size of the set Z = {(C\ D [v], . . . , Ct n [v]) : 
p(Ci, . . . , Ct) = w} is bounded by the number of tuples (Tin[v], . . . , T s n[u]). Now Ylt=i l^ 1 " 1 
[v]\ < s6v, where 9 = t/Q 2 < 1. So by Lemma 15. II 

log \Z\ < s9v{l + \og{\/9)) < s9vlog(l/r) = C~ 2 f trr log(l/V) , 

which completes the proof. □ 

7. An example of large containers 

Theorem 12.51 provides a small collection of containers for independent sets in an r-graph, 
each container having degree measure at most 1 — 1/r! (plus a term that is usually small). 
It is conceivable that an algorithm different to the one in $3] might yield smaller online 
containers. But in this section we describe examples to indicate that 1 — 1/r! is the limit 
of the present method. 

Indeed, for each r > 2, there are examples of simple r-uniform hypergraphs G and 
independent sets / for which /i(C(G, T, r, £)) can be as large as 1 — 1/r! + o(l) when 
T = T(G, I, r, £). Because a precise detailed definition of the examples would be very long 
and opaque, we give instead a sketch showing how the examples work, together with a few 
words about how the details can be filled in. 
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Figure 1. Approximate setup sufficient to produce a container of measure 
23/24 in a 4-graph G. 

7.1. Sketch for r = 4. Suppose G is a simple (i-regular 4-graph with the following proper- 
ties (see Figured]). Let the vertex set be AU Bill B 2 U B 3 U B±, where \Bi\ = 18k, \B 2 \ = 4fc, 
\B%\ = k, \B±\ = k, \A\ = o(k), so ix{B±) = 1/4! + o(l) and so on, where o(l) — > as d — > oo. 
Since G is regular, the degree ordering used by Theorem 12.51 is not determined by the 4- 
graph itself, so we may suppose we have an ordering with A < B\ < B 2 < Bs < B4. Let 
the induced subgraph of G on B\ U • • • U B4 be approximately (i-regular (so that only a 
few edges meeting B\ U • • • U B4 also meet A), and suppose every edge has 3 vertices in B\ 
and 1 vertex in B 2 U B3 U B4 (the sizes of the sets Bi have been chosen to ensure that this 
is possible). Suppose we run the algorithm with r = (i -1 / 3 (which is roughly the typical 
value of r for a (i-regular simple 4-graph). 

First the algorithm is run with s = 3. After the vertices in A have been inspected but 
before any vertex in B\ has been inspected, the partially constructed subgraph of P3 will 
contain 3-edges that depend on / and on the edges of G that contain a vertex of A. But 
suppose for now that we can choose I,G so that at this point, P3 contains a d 2 / 3 -regular 
graph on B 2 L)B^DB4, in which every edge has two vertices in B 2 and one vertex in B3UB4 
(see Figure [TJ . Then since T3 contains vertices with degree at least rd = d 2 ^ 3 , all vertices 
in B 2 U B% U B4 will be in Therefore, as the algorithm for P3 proceeds to inspect the 
vertices after A, all vertices in B\ will be kept in C3 (since, for each vertex v € B\, there are 
very few edges whose first vertex is v that do not meet T3). Furthermore, all the vertices 
in B 2 U -B3 will be kept in C3 since they are not the first vertex of any 4-edge in G. 

Similarly, if we can arrange things so that when the algorithm is run with s = 2, B3UB4 
contains a d 1//3 -regular bipartite graph, and when the algorithm is run with s = 1, B4 
contains a 1-regular 1-graph, then the sets B\ U B 2 U B3 will be kept in C 2 and C\. Also Y\ 
contains no singletons from B\ U B 2 U B 3 , so B\ U B 2 U B 3 C C, and so 11(C) > 23/24 + o(l) 
as required. 

This broad setup generalizes to all r by taking sets B\ , . . . , B r of size \B{\ = ( (r — i + 1) ! — 
(r — i)\)k, such that the induced graph of G on BiU- ■ - L)B r is approximately (i-regular with 
every edge having r — 1 vertices in B\ and 1 vertex in B 2 U • • • U B r . Then provided that the 
P s are similarly generalized, we will have BiU- ■ -l)B r -i C C and so n(C) > 1 — 1/r! + o(l). 

7.2. Priming the P s . In the example above, we did not specify how to guarantee that the 
graphs P s are as stated. This is achieved by adding edges to G containing vertices from A 
and specifying I C A appropriately; we indicate how this may be achieved for r = 4. 

Suppose \A\ = 2/c(f -1 / 3 . Choose an equipartition D\ U D 2 of B 2 , and consider the 4 
vertex classes A, D\, D 2 , B3 U B4. The last three classes each have size 2k. Add edges to 
G forming a simple 4-partite 4-graph between these classes, such that in this set of edges 
the vertices in A have degree d and the vertices in B 2 U B% U B4 have degree d 2 ^ 3 . Now 
let I = A. Consider what happens when the algorithm is run on this graph. Each vertex 
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v £ A will be added to T3 and the corresponding edges will be added to P3, producing the 
required G? 2 / 3 -regular graph on B 2 U B3 U B4. 

Suppose now that A = A 1 U A 2 U A 3 , \Ai\ = kd~ 2/3 , \A 2 \ = kd~ 1/3 . Consider the 4 
vertex classes A%, A 2 , B3, P4. Add edges forming a simple 4-partite 4-graph between these 
classes, such that in this set of edges the vertices in A± have degree d, the vertices in A 2 
have degree d 2 / 3 , and the vertices in P3 U P4 have degree d 1 / 3 . Add also edges between A 2 
and A3 so that every vertex in A 2 U A3 has degree d in G. Let / = A\ U A 2 , and suppose 
the degree ordering on A is Ai < A 2 < A3. When the algorithm is run, the vertices in A\ 
will be added to T3. Let a\ G A\ be the last vertex in the degree order in A\. After vertex 
a\ has been inspected by the algorithm, the graph P3 will be a 3-partite graph between 
classes A 2 , B3, B4, where the vertices in A 2 have degree d 2 ^ 3 and the vertices in B3 U P4 
have degree d 1 ^ 3 . Every vertex v G A 2 will be added to T 2 , and the graph P 2 will then be 
the required <i 1//3 -regular graph on B3 U B4. 

The graph P\ can be achieved similarly. This argument overlooks the point that the 
graphs Pi j P 2 , P3 cannot be simultaneously produced as stated; for example, it would 
require the degree in P3 of a vertex in b G B 2 to be d 2 ! 3 + d 1 / 3 + 1, whereas the maximum 
degree, by construction of P3, is d 2 ! 3 + 1. Nonetheless, we can adjust some of the edge sets 
slightly so that the example works. These constructions generalize to all r. 

8. List colourings 

In [37], a lower bound for the list colouring number of a regular hypergraph was proved. 
Theorem 2.1 of that paper, based on a simple probabilistic argument, gave a bound of 
approximately (logfc)/log(l/c) provided there is a collection C of containers for the inde- 
pendent sets, with |C| < (1 — c)n for each C G C and with \C\ < e n l k . This proof fails to 
work for a general hypergraph because it is not possible to find containers of bounded size, 
but Theorem 12.81 provides conditions under which we can recover the proof. 

Let [t] be a set of colours. We say that a collection of lists {L u C [t] : u G [n]} is 
C-compatible if there is a colouring function / : [n] — > [t] and a tuple (Ci, . . . , Ct) € C, such 
that, for each u G [n], f(u) G L u and u G C/( u )- 

Theorem 8.1. Let < e, c < 1. T/ien £/iere exists ko = fco(e, c), suc/i t/iat t/ie following 
property holds for all k > k^. 

Lett = |_(1 — e)logfc/log(l/c)J and lett = [2£ 2 /c\. Letn> k and letC C V\n\. Suppose 
that there is a map g : C* —> [k,n], such that 

1 * 

(a) _^|QnM| < (l-c)« 

1=1 

holds for every (C\, . . . , Ct) G C* ; where v = g(C\, . . . , Ct). Suppose moreover that 

(b) \{(C 1 n[v},...,C t n[v}) : g(C 1 ,...,C t ) = v}\ < e vt l k 

holds for all v G [n]. Then there is a collection of lists {L u : u G [n]}, each of size \L U \ = £, 
which is not C-compatible. 

Proof. For each u G [n], let L u G [t]^' be a subset of [t] of size £ chosen uniformly and 
independently at random, and let C = {L u : u G [n]} be the collection of lists. 
Given a tuple (Ci, . . . , Ct) G C*, we define, for each u G [n], the set of colours 

B u = B u (C 1 ,...,C t ) = {i£ [t]:ueCi}. 
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We say that C fits (Ci, . . . , Ct) if L u n B u ^ for each u G [n]. This is the same as saying 
there is a function / : [n] — > [t] with f(u) G L u n -B M , or in other words, /(«) G £ u and 
u G C/( u )- Hence we shall prove the theorem by showing that, with positive probability, C 
fits no tuple (Ci, . . . , Ct), since then C is not C-compatible. 

In fact, we claim something stronger: with positive probability, C rejects every tu- 
ple (Ci,...,Ct), meaning that there is some u G [t>] with L u n B u = 0, where u = 
g(C\, . . . , Ct). To establish the claim, fix for the time being some tuple (Ci, . . . , Ct) and 
let v = g(C\, . . . ,Ct). Let u G [v] and let p n be the probability that L u D B u = 0, or 
equivalently L u C [t] \ B u . Then 

p M = Pr(L u n S u = 0) = f M (f) where z u = max{£ - 1, t - |J3„|} . 



cA /t x : 



J J V, 

We note here that £ > 1 if ko is large enough and thus ct > £. Write z for the average of 
the values z u for it G [v]; then by condition (a) of the lemma we have 

t 

VZ= ^2 Z u >^t - \B U \ =vt-^2 \ B u\ = Vt- ^ \ C * n HI > vct • 

So, since the function ( 2 ^) is convex for z u > £ — 1, we have 

ue[v] ue[v] v 7 v 7 

Since f > 1 we have -l)/t < {£- l)/{2£ 2 /c - 1) < c/2€, and so (c - - 1)/^ > 
c/(l — l/2£) e > c e /2. Hence the probability that £ fails to reject (Ci, . . . , Ct) is 

Pr(5 u n L u / for all u G [«]) = JJ (1 - p u ) 

«e [<;] 

< exp{- ^ p M } < exp{-uc < /2} . 

u£[v] 

Notice that the probability of (Ci, . . . ,C\) not being rejected depends only on the tuple 
(CiPl [v], . . . ,CtH [u]) which, by condition (b) of the lemma, takes at most exp{vt/k} values 
as (Ci, . . . , Ct) ranges over C*. Hence if we fix u and write P v for the probability that there 
is some tuple (Ci, . . . , Ct) with v = </(Cl, . . . , Ct) which is not rejected, then 

P v < exp{vt/k - vc e /2} 

4 f( l-e)logfc y , , 
log 1/c 



< 




\v r 


exp < 


2fc 


< 




f 


exp< 




< 






exp < 


A v 


< 


v- 2 


if fc 



since k < v 
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Finally, if we consider all tuples (C\, ... ,Ct) G C*, the probability that one of them is 
not rejected is at most 

E p « ^ E^ 2 < 1 

ue[fe,n] v>k 

if fco is large. This establishes our claim and so proves the theorem. □ 

We can now prove a lower bound on the list chromatic number of a hypergraph. It 
is awkward to state the most general conditions under which a non-trivial bound can be 
obtained, so instead we state a couple of typical results. The first applies to many r-graphs 
of average degree d, and certainly to simple r-graphs; indeed, Theorem 1 1 . 1 1 stated earlier is 
the special case when G is simple. 

Corollary 8.2. Let r G N be fixed. Let G be an r-graph with average degree d. Suppose 
that Sfi(v) < f r every v e V(G) and for2<j <r (recall Definitional), 

where o(l) — > as d — > oo. Then 

Xi(G) > (l + o(l))^-^log r d. 

Moreover, if G is regular then 

Xi(G) > (l + (l))-^log r d. 

r — 1 

Proof. We shall apply Theorem 12.81 to G. Choose £ = £(<f) so that, as d — > oo, then 
( = 0(1), ( = and d {j \v) < dC'-i)/^- 1 ^- 1 for all v G V(G) and 2 < j < r. Let 
T = cT 1/(r_1) e~ 3 - Then ( < l/12r! because C = o(l). Also, recalling Definition [231 
5j = X) v S3){v)/T3- X nd < C 2 = o(C), so -5(G,r) < C- Moreover r < C 2 /r because C = d°^ . 

Let A; = LC 3 / rl og(i/V)J- Then log A: = (l/(r - 1) + o(l))logd Let u E V(G). For 
each u G V"(G), at most d^ 2 \v) < d( r_2 )/( r_1 )^ _1 edges contain both n and u, and so 
d(v) < nd^y^-^C 1 . It follows that n([k]) < {l/nd)knd^- 2 ^ ( - r -^ C 1 < C 5 < C/2H. So 
the conditions of Theorem 12.81 are satisfied. It follows that there exists a collection C of 
containers for the independent sets of G, satisfying properties (b) and (c) of Theorem 12. 8\ 
and since £ _2 rr log(l/r) < 1/A; it follows that conditions (a) and (b) of Theorem 18.11 are 
satisfied, with c = 1/r! - 8C > (1 + o(l))r"( r - 1 ). 

Consequently there are lists of size (1 + o(l)) log &/log(l/c) that are not C-compatible, 
which is to say lists of size at least (l/(r— l)+o(l)) log d/log(l/c) > (l/(r— l) 2 +o(l)) log r d. 
Since C is a set of containers for the independent sets of G, the first claim of the corollary 
follows. 

The proof for regular graphs is similar, except that we take c = 1/r + o(l). To achieve 
this we make use of Corollary 12.71 instead of Theorem 12.81 With r, £ and k defined as 
before, we can take e = ( in Corollary 12.71 because 5(G,t) = o(£). We obtain a collection 
C of containers such that e(G[C]) < C e (G) = o(e(G)) for all C G C. Because G is regular 
this means that \C\ < (1 — c)n, where c = 1/r + o(l) and n = \G\. We can now apply 
Theorem 18.11 by defining g(C\, . . . , Ct) = n for all {C\, . . . , Ct); note that condition (b) of 
the theorem is satisfied because, by Corollary 12.71 log |C| < c(r) log(l/e)nrlog(l/r) < n/k. 
The remainder of the proof is the same. □ 

The bound given for r-graphs of average degree d is weaker than that for regular r-graphs 
because we only had containers of measure 1 — 1/r! available, rather than 1 — 1/r. Probably 



HYPERGRAPH CONTAINERS 



29 



this is an artifact of our algorithm, and Xi{G) > (l/(r — 1) + o(l)) log r d holds for r-graphs 
of average degree d. 

The bound for regular graphs is tight. Indeed, let K(r, m) be the complete r-partite 
r-graph with m vertices in each class. Suppose that lists of size I are given to the vertices. 
Randomly choose, for each colour in the palette, a vertex class on which that colour is 
forbidden to be used; then the expected number of vertices with no available colour is 
rmr~ e which is less than one if I > 1 + log r m, and so xi < 2 + log r m (see Haxell and 
Verstraete [23]). This graph is d-regular where d = m r ~ 1 so xi < 2 + (l/(r — l))log r <i. 
Note that d^\v) = m r - j = d&'-M'- 1 ). 

It is not hard to construct an m-regular simple subgraph G of K(r,m), and so (putting 
d = m) we have simple d-regular r-graphs with xi < 2 + log r d. Quite possibly xi < 
2 + (l/(r — l))log r <i in this case too, because a subgraph of G with d l ~ l ^ r ~ l "> vertices 
in each class is likely to be very sparse, and a random colouring might be repairable if 
rdr~ £ < d 1-1 '^ -1 ), or £ > 1 + (l/(r — 1)) log r d. But this argument is far from rigorous. 

As an illustration of the use of containers for non-independent sets we finish with the 
next result. 

Corollary 8.3. Let G be a graph with average degree d. Then, for each u E V(G) there is 
a list L u of (l + o(l)) log 2 d colours, such that it is not possible to choose a colour c(u) G L u 
with the vertices of each colour spanning a planar graph. 

Proof. We follow the proof of Corollary 18.21 with r = 2, except we use a set C of containers 
for those subsets / for which G[I] is planar. Since a planar graph is 5-degenerate, we can 
apply Theorem 1 2 . 8 1 and continue with the proof exactly as before, provided 5 < rdQ/r. But 
r = d -1 £ -3 so this condition holds comfortably. □ 

9. //-FREE GRAPHS 

Theorem 11.31 is obtained by a routine application of Corollary 12.71 to the following hy- 
pergraph, whose independent sets correspond to H-fiee ^-graphs on vertex set [N]. 

Definition 9.1. Let H be an ^-graph. Let r = e{H). The r-graph G(N,H) has vertex set 
[N]W, where B = { Vl ,...,v r } G V(G)^ is an edge whenever B, considered as an £-graph 
with vertices in [N], is isomorphic to H. 

Lemma 9.2. Let H be an i-graph with r = e(H) > 2 and let 7 < 1. For N sufficiently 
large, S(G(N, H),^ 1 iV- 1 /™^)) < r2 r2 v(H)\ 2 -/. 

Proof Let G = G(N, H). Consider a C [N]^ ( so a is both a set of vertices of G and an 
^-graph on vertex set [N]). The degree of a in G is the number of ways of extending a 
to an £-graph isomorphic to H. If a as an I- graph is not isomorphic to any subgraph of 
H, then clearly d(a) = 0. Otherwise, let v(a) be the number of vertices in a considered 
as an £-graph, so there exists V C [N], \V\ = v(o~) with a C V^'. Edges of G containing 
a correspond to copies of H in [iV]w containing a, each such copy given by a choice of 
v(H) — v(o~) vertices in [N] — V and a permutation of the vertices of H. Hence d(a) = 
Ca (v{H)-v(a)) f° r some integer c a in the range 1 < c a < v(H)\. Thus for N sufficiently 
large, 

l/v(H)\ < d(o-)N- v W +v ^ < v{H)\. 
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For v € V(G) and 1 < j < e(H), the quantity d^\v) is the maximum of d(a) over all 
a C [JV]W with v G a and \a\ = j. Thus 

l/v{Hy.<d (j) {v)N- v(H)+fU) <v{Hy., where f{j)= min 

H'CH,e(H')=j 

Let r = 7 - 1 AT- 1 /^W. Since f(l) = £ and 7 < 1, for 2 < j < e(JET) 

a. = &j^M- < vWW-iNt-M < v ( H) ? N e-f(J)+(j-i)/™W 

By definition of f(j) and m(H), £ — f(J) + (j — l)/m(H) < 0. Hence 5-,- < v(H)] 2 j and so, 
with r = e(H), 

r 

S(G,t) =2^)- 1 ^2-^ 1 )5 j < r2 r ^(F)! 2 7 

as claimed. □ 

We use a well-known supersaturation theorem to bound the number of edges in contain- 
ers. 

Proposition 9.3 (Erdos and Simonovits [20]). Let H be an £-graph and let e > 0. There 
exists Nq and n > such that if C is an £-graph on N > Nq vertices containing at most 
rjN< H ) CO pi es f r then e(C) < (tt(H) + e)(^). 

Proof of Theorem \1.3[ Let n be given by Proposition 19.31 let /3 = min{e, n}, let G = 
G(N,H), let r = e(H) and let r = I2r\r2 r2 v(H)! 2 N- 1 /™^ / (3. By Lemma 19^1 S(G,t) < 
(3/12r\. The vertex set of G is [N]^\ n = (^); and the edge set of G is the set of f-graphs 
isomorphic to H, of which there are at most v(H)\( v IJj^) . We claim that the collection C 
given by Corollary 12.71 applied with ft for N sufficiently large satisfies the conditions of the 
theorem. Write d for the constant c = c(r) in Corollary 12.71 

Condition (a): C covers the independent sets of G, which are precisely the H-iree ^-graphs 
on vertex set [N]. 

Condition (b): for each /, the corresponding C E C with / C C is specified by a tuple 
T = (Ti, . . . , T s ) G V s (I) with s bounded by a constant depending on r and e. Furthermore 
\Ti\ < dm < dV^-VmW for c sufficiently large function of H, e. 

Condition (c): the number of copies of H in C is at most /3e(G) < f3N v(H \ Note also 
that since (3 < n, Proposition 19.31 implies that e(C) < (tt(H) +e)( i ). 

Condition (d): this holds as log |C| < d log(l//3)nrlog(l/r) < ciV^ 1 /" 1 ^ log iV for c 
and N sufficiently large. □ 



Proof of Corollary \1.4\ We may assume that e(H) > 2, since otherwise the result is trivial. 
Let e > and let C C V([N]^) be the collection given by Theorem 11.31 (we may assume 
that N is sufficiently large). Note that \C\ = 2°^ and e(C) < (ir(H) + e)(^) for each 
C £ C. Since every H-free £- graph on N vertices is a subgraph of some graph C £ C, this 
means there are at most X^cec < 2 iy ' n<yH ^ + ' L+oiy1 ^^ H-free graphs on N vertices. Since 
e > was arbitrary this completes the proof. □ 



Proof of Theorem \1.5\ The proof is very similar to Theorem 11.31 an d we only sketch the 
details. 
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Let r = )• Let G be the r-graph whose vertex set is two copies of [N]V>, denoted 
by Vr and Vr (vertices in Vr correspond to hedges and vertices in Vb correspond to non- 
£-edges), and whose edges correspond to copies of H; thus / G (Vr U Vr)^ is an edge of 
G whenever / n Vr and / n Vr are the edges and non-edges, respectively, of an ^-graph 
isomorphic to H with vertices in [N]. Note that every induced- H- free £-graph I C [N]^ 
corresponds to an independent set of G, namely the set of vertices in Vr corresponding to 
the edges of I together with the set of vertices in Vr corresponding to non-edges of I. 

G has very similar properties to G' = G(N, K^) H -. ) given in Definition 19.11 where ^jfm 
is the complete £-graph on v(H) vertices. In particular, for fixed r, the 5j for G differ only 
by a constant factor by those for G' . Let m = m(K^ H ^) = (j^^) — lj /(v{H) — £), so 

that 5(G,N- x l m /e) = O(e). 

As in the proof of Theorem 11.31 we may apply Corollary O with r = 0(N- l / m /e) 
to obtain the required collection C C V(Vr U Vr), where each C G C is identified with a 
2-coloured ^-multigraph in the obvious way. □ 

We now prove Theorem 11.61 First we must define the function h p (H). For a 2-coloured 
£-multigraph J with red and blue edge sets Jr and Jr, define 

h p(J) = ~\ J R ~ j b\ log 2 p- \ Jb ~ Jr\ log 2 (l ~P)- 

Observe that the probability that a random £-graph on the same vertex set is a subgraph 
of J is 2~ H vi J ). Let 

hex(#, N) = min{ fl" p (J) : Jr U Jr = [iV] W , £T ^ J } . 

Then we put h p (H) = limjv^oo hex(H, N)(f%) 

As always, we need a supersaturation lemma for the proof of Theorem 11.61 The proof of 
the following is essentially the same as that of Proposition 19. 3\ and we do not give details. 

Lemma 9.4. Let H be an l-graph and let < e,p < 1. There exists Nq and 77 > such 
that if C is a 2-coloured l-multigraph on N > Nq vertices containing at most rjN v ( H ^ copies 
ofH then H P (C) > (h p (H) - e)( N £ ). 

Proof of Theorem \1.6\ The lower bound on the stated probability follows from the defi- 
nition of h p (H). Let e > and let r be given by Lemma 19.41 Let C be the collection 

of 2-coloured ^-multigraphs given by Theorem 11.51 satisfying \C\ = and for every 

C G C, the number of copies of H in C is at most i]N v ( H > . By Lemma 19.41 for each C G C 
we have H P (C) > (h p (H) — e)(^)- Since every induced- U-free graph on vertex set [N] is 
contained in some C G C, 

¥(G i£) (N,p) is induced-#-free) < J2^ Hp{C) < 2-(M H )- £ +°( 1 ))(?) . 

Since e > was arbitrary this completes the proof of Theorem 11.61 □ 

10. Linear equations 

In this section, F denotes a finite field, the set of integers [N], or an abelian group. 
We consider linear systems of the form Ax = b, where A is a k x r matrix. As in the 
introduction, when J 1 is a finite field or [N], then A has entries in F. Our methods apply 
to abelian groups as well: in this case, let A have integer entries, where integer-group 
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multiplication ax, a G Z, x G F, is a copies of x, x + • • • + cc; or —a copies of — x if a is 
negative. The definitions of full rank and abundant given in the introduction extend to 
abelian groups. 

Often one wishes to discount solutions to an equation Ax = b where the vector x contains 
repeated values. For example, in forbidding a 3-term arithmetic progression, we take 
A = (1,1,-2) and b = (0) and discount solutions of the form x + x — 2x = 0. To 
accommodate this setup, we let Z C F r be a set of discounted solutions. We also call 
(F, A,b,Z) a k x r linear system, where a solution to the system is a vector x 6 F r — Z 
such that Ax = b. A subset / C F is solution-free if there is no x G I r — Z with Ax = b. 

In this setup, we could also define ex(F, A, b, Z) to be the maximum size of a solution- 
free subset when Z ^ 0. However, this turns out to be unnecessary, since typically \Z\ = 
o(\F\ r ~ k ) so ex(F, A, b, Z) = ex(F, A,b) + o(\F\) by Proposition HTT71 

Definition 10.1. Let (F, A,b,Z) be a k x r linear system with A abundant. When F is a 
finite field or [N], define (as in Rodl and Rucihksi |40j ) 

,A\ I- 7 !- 1 
rnpfyl) = max — - - -, 

Jc[r],\J\>2 | J| - 1 + rank(Aj) - k 

where the matrix Aj is the k x (r — | J|) submatrix of A obtained by deleting columns 
indexed by J. Otherwise, when F is an abelian group, let t be the maximum value of j for 
which A j has full rank whenever \ J\ = j, and let 

(A\ k + t-1 
m F {A) = — - . 

It can readily be checked that if A is abundant then the denominators appearing in the 
definition of rriF{A) are strictly positive. The separate definition of mp(A) when F is an 
abelian group is necessary since the rank of an integer matrix over an abelian group is not 
well-defined; in general, when the pair (F, A) could either be considered a finite field or an 
abelian group with A integer valued, the value of the second definition is at least as big as 
the value of the first definition. This is since rank(^4j) = k when \J\ <t, and is otherwise 
at least max{0, k + t — | J|}. 

This is our main theorem for linear equations. The determinantal of a k x r integer 
matrix is the greatest common divisor of the determinants of its k x k submatrices. 

Theorem 10.2. Let e > and let (F, A,b, Z) be a k x r linear system with A abundant. 
There are constants c,/3, depending on A,e in the case F = [N], and depending only on 
k,r,e otherwise, such that if \F\ > c and \Z\ < j3\F\ r ~ k then there exists C C V(F) 
satisfying 

(a) for every solution-free subset I C F there exists C G C with I C C , 

(b) moreover, for each pair I,C in (a), there exists T = (T\, . . . ,T S ) where Ti C /, 
s<c and J2i \ T i\ < c\F\ l - l / mF ^ A \ such that C = C(T), 

(c) for every C £ C, the number of solutions to Ax = b with x G C r — Z is at most 
e\F\ r ~ k , 

(d) if F is a finite field or [N] , or F is an abelian group and the determinantal of A is 
coprime to \F\, then \C\ < ex(F,A,b) + e\F\, 

(e) log |C| < clF^-V^WloglF]. 

Theorem 10.3. Let (F, A,b, Z) be a k x r linear system with \Z\ = o(\F\ r ~ k ) and A 
abundant, and additionally with the determinantal of A coprime to \F\ in the case that F 
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is an abelian group. Then the number of solution- free subsets of F is 2 ex ( F,A ' b ) + °(l F l). Here 
o(l) — > as \F\ — > 00, with A fixed in the case F = [N]. 

Note that Theorem 110.31 generalizes Theorem [L9] by including abelian groups and allow- 
ing Z ^ 0. 

If A is not abundant, then the conclusion of Theorem 110.31 need not hold. For example, 
let A = (1, 1), b = (0), and consider the cyclic group C n for n odd. Observe that the pairs 
(x,y) such that x + y = and x / y partition C n \{0}. Therefore ex(C n ,A, b) = (n + l)/2. 
However, one can construct a solution- free set by including either x or y or neither for each 
pair (x,y), so there are at least ^ n ~ 1 )/ 2 solution-free sets. There are similar examples with 
larger values of k and r > k + 2. 

Additionally, when F = [N], the condition that A is fixed as \F\ = N — > 00 is necessary. 
For example, for the equation w + x + (10iV)y — (10iV)z = N, the maximum size of a 
solution- free subset of [iV] is N/2 (since for every pair w, x G [N] with w + x = N, a 
solution- free set can include at most one of w or 1), but there are at least 3^ N_1 ^ 2 solution 
free sets, since for every w,x G [N] with w + x = N and w 7^ x, we can include either w 
or x or neither to form a solution- free set. 

Theorem 110.21 follows from an application of Corollary 12.71 to the following hypergraph, 
whose independent sets correspond to solution-free subsets of F. 

Definition 10.4. Let (F, A,b,Z) be a k x r linear system. The r-partite r-graph G = 
G(F, A, b, Z) has vertex set V{G) = X\ U • • ■ U X r , where each Xi is a disjoint copy of F, 
and edge set E(G) = {x = (x±, . . . , x r ) 6 X\ X • • • X X r — Z : Ax = b}. 

Fact 10.5. Let F be a finite field or abelian group, let A be a k x I matrix and let b G F k . 
If A has full rank then there are solutions to Ax = b. More generally if F is a finite 

field, there are at most |^?|^- rank ( A ) solutions to Ax = b. 

Proof. If A has full rank, then for every 61, 62 £ F k there exists with Ax = 62 — b\. 

Thus if x\ is a solution to Ax\ = b\ then A(x\ + x) = 62, so by symmetry every b E F k has 
/\F\ solutions to Ax = b. The case when F is a finite field is standard. □ 

Lemma 10.6. Let (F, A,b,Z) be a k x r linear system where F is a finite field or abelian 
group, A is an abundant matrix and \Z\ < \F\ r ~ k /2. Let G = G(F, A,b, Z), 7 < 1 and 
T = |jp|-i/"»fW/ 7 . Then 5{G,t) < r2 r2 j. 

Proof. The number of edges in G is the number of solutions to Ax = b not in Z. The matrix 
A has full rank, so by Fact 110.51 the number of edges of G is |F| r_fc — \Z\ > \F\ r ~ k /2. 

For v € V{G) and j > 2, d^\v) is the maximum over all sets a = {y\, . . . , yj} with v G a 
of the number of edges of G containing a. If a contains two vertices in the same part X{ 
then there are no edges containing o. Otherwise suppose yt G for £ = 1, . . . , j, where 

1 < ii < • • • < ij < r. Let J = . . . ,ij}. The number of edges containing a is at most 
the number of solutions to Ax = b with x\ t = yi for I = 1, . . . ,j, which for some b* G F k 
is the number of solutions to Ajx* = b* , x* G F r ~° . We now split the proof into two cases 
depending on whether F is a finite field or an abelian group. 

When F is an abelian group: If j < t (recall Definition 110. ip , then A j has full rank by 
assumption, and so Fact 110.51 implies the number of solutions is at most \F\ r ~i~ k . When 

2 < j < t, we have d^\v) < \F\ r -i~ k , so 

V d^(v) 2\F\ r ~i~ k , ■ , ■ 
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t T l—t—k 



When t + l<j<t + k, the bound $i\v) < d®(v) implies that 

9l Z?\r—t—k 

63 < \ L z 1 = 2|F| 1 -V 1 ^ < 2IFI 1 

J rr-J — lip 1 ' K — 1 11 11 

When t + A; + l<j<r, the bound d^\v) < \F\ r ~i implies that 

9 1 TP\ r ~ 3 

§ < . y J - = 2|F| fc ^+ 1 r 1 ^. 
j — 7~j — 1 F\ r — — 

Since r = 7 -i|i?|-V™F(A) = ^-1 and 7 < 1, this implies that 6j < 2j for 

all j. 

When F is a finite field: By Fact 110.51 the number of solutions to A jx* = b* is at most 
| F | 7 ._ i - ra nk(A J )_ Hence 

d&(v) < max \ F \r-j-™nk(Aj) ^ 
JC[r],\J\=j 

Using r = ^~ l \F\~ 1 / mF ^ and 7 < 1, this implies that 

S-= <o max |p|l-i+fc-rank(Aj)+(j-l)/m P (A) 7 _ 



The exponent is at most be definition of 7jif(A) so <5,- < 27. 

In both cases, S(G, r) = 2(5) -1 ^=2 2 ^^ 5 j < ^7- □ 

The proof of part (d) of Theorem I1U.2I requires the following removal lemma of Krai', 
Serra and Venna. Note that if A has determinantal coprime to \F\ then in particular A has 
full rank. 

Proposition 10.7 (Krai', Serra and Vena |31|. 132]). Let (F,A,b) be a k x r linear system 
where F is a finite field or abelian group and A has full rank. Suppose further that A has 
determinantal coprime to \F\ in the case that F is an abelian group. Let a > 0. Then there 
exists T] = r](a, k,r) > such that for all Y±, . . . ,Y r C F and b G F k , if there are at most 
rj\F\ r ~ k solutions to Ax = b with xi G Yi, then there are sets Y[ C Yi, ... ,1^,' C Y r with 
\Yj\ < a\F\ such that there is no solution to Ax = b with Xi £ Yi\Y( . 

Proof of Theorem \1 0.21 We may assume that F is a finite field or abelian group. Indeed, 
[N] can be embedded into the finite field Z p for a sufficiently large prime p. Taking p in 
the range Ak\\A\ k N < p < 8fe!|^| fc iV, where \A\ is the sum of the absolute values of A, 
guarantees that A is still abundant in Z p and that a solution to Ax = b (mod p) is also a 
solution to Ax = b (provided, say, \bi\ < p/2; but we may assume this since otherwise there 
are no solutions to Ax = b in [N]). Then the result of this theorem for (Z p , A, b, Z) implies 
the result for ([N], A, b, Z), since p/N is bounded by a constant depending only on A. 

Let r] be given by Proposition 110.71 applied with a = e/r. Let j3 = min{e, r//2, 1/12H}. 
Note that j3 = (3(e,k,r). 

Let G = G(F,A,b,Z) and let r = 12r!r2 r2 \F\~ 1 / mF ^ / fi so that by Lemma US 
5{G,t) < (3/12r\. Note also that r < l/144r! 2 r for |F| sufficiently large, so Corol- 
lary 12.71 gives a collection of sets T> covering the independent sets of G. For D £ D, 
let TTi(D) = D n Xi C F be the part of D in the ith copy of F and let 

C = {C{D) :D€V}C V(F) where C{D) = tti(D) f~l • • • f~l ir r (D). 

We claim that C satisfies the conditions of the theorem. 
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Condition (a): consider a solution-free set I C F. The subset J of V(G) formed by 
taking a copy of I in each is an independent set in G. In particular, it is contained in 
some D £ T>, hence / C C(D). 

Condition (b): for I,C in (a), there exists a tuple T = (T%, . . . ,T S ) such that T, L C J, 
D = D(T), \Ti\ < 288r! 2 rn < c\F\ 1 ~ 1 / mF ( A ) and s < c for c sufficiently large as a function 
of r and /3. Thus C is specified by the rs-tuple (Ti D Xi, . . . , T x D X r , . . . , T s n X r ) C I rs . 
This verifies condition (b) (with rs in place of s). 

Condition (c): consider C 6 C. Each solution to = b with x £ C r — Z corresponds 
to an edge of of which there are at most f3e(G) = j3\F\ r ~ k . 

Condition (d): there are at most /3\F\ r ~ k + \Z\ < rj\F\ r ~ k such x € C r with Ax = b, so 
by Proposition 110.71 there exists a set Y = Ujl^ C C, |Y| < e|-F| such that there are no 
x e (C- y) r with Ax = 6, implying that |C| < ex(F, A, 6) + e|F|. 

Condition (e): observe that 

log |C| < log |2>| < c(r)log(l//3)|F|rlog(l/r) < c |F| 1_1 / m ' , ( A ) log 
for c sufficiently large as a function of r and /3. □ 

Proof of Theorem \10.3[ The proof is the same as the proof of Corollary II, 44 where the 
collection given by Theorem 1 10. 2 1 is used instead of the collection given by Theorem II .31 □ 

11. SlDON SETS 

In this section we prove Theorem 11.101 To prove the upper bound we construct, in 
the natural way, the hypergraph representing the solutions to w + x = y + z in a subset 
Sc. [n]. We then apply Theorem 15.31 in an entirely mechanical way; all that is needed is to 
set appropriate values and to check the conditions. We remark that the codegree function 
5 for this hypergraph exhibits a change in behaviour when \S\ < n 2//3 , as the dominant 
contribution then comes from 62 rather than £4 (see equation ([6])); Kohayakawa, Lee, Rodl 
and Samotij [29J noticed an interesting behavioural change at the same point, for a closely 
related problem. 

Proof of Theorem \1.1Q[ Lower bound construction. Suppose n = Ap(p—1) for some prime p. 
Ruzsa [H] shows that there is a set S C [p(p — 1)] of size p — 1 such that every sum of 
two elements of S is distinct modulo p(p — 1). Thus for any U\,U2,Uz,U± C S satisfying 
Ui n Uj = for i / j, the set 

U x U (U 2 + pip - 1)) U (U 3 + 2p(p - 1)) U (U A + 3p(p - 1)) 

is a Sidon subset of [4p(p — 1)], where V + x := {v + x : v £ V}. This gives 5 P_1 = 
^(i+o(i))v^ > 2(1 .i 6+0 (i))VH g idon su b S ets of [n] = [4p(p- 1)]. The general case follows by 
embedding [4p(p— 1)] into [n], where p is the largest prime such that 4p(p—l) < n, and using 
the fact that the ratio of successive primes tends to 1. (We note that any construction for 
large modular Sidon sets could have been used here; this includes the classical constructions 
of Singer [50] and of Bose [9].) 

Upper bound. Let G be the 4-graph on vertex set [n], where {w,x,y, z} £ [n]^ is an 
edge whenever w + x = y + z. Sidon sets correspond to independent sets in G (although 
the converse is not always true, since solutions to w + x = y + z where w = x or y = z 
do not correspond to edges of G). We shall apply Theorem 15.31 to the graph G; our task is 
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to bound f(m). To this end, let /3 = 3 X 10 14 , let uq = f3y/n, and consider U C [n] where 
u = \U\ > ^o- 

For ie[n- 1], let U = \{{x,y} G t/ (2) : x < y,y - x = i}\. Note that = Q). Each 

pair of sets {w, z} / {y, x} with w — z = y — x corresponds to an edge with w + x = y + z, 
and each such edge corresponds to the two pairs {w, z} ^ {y, x} and {w, y} ^ {x, z}. Hence 
the number of edges in G[U] satisfies 



m = e(G[U]) = \ g > (-1 p ^ > n 4 /2Qn) 

where the last inequality holds for u > uq. Let eo = /3 4 n/20, as used for Theorem 
Thus m > eo- Let fc = 12r! = 288. For the application of Theorem 15.31 put 

T = t{U) = max{24£ru 2 /m, (4ku/m) 1 ^ 3 }. 



We must check that the conditions of Theorem 15.31 hold. 

Recall the definition of d^\w). In G[U], observe that S 2 '(w) < u/2 + u = 3u/2, since 
for x 6 U, the number of solutions of the form w + x = y + z is at most u/2 and the number 
of solutions of the form u; + y = x + 2:isat most u; similarly d^ 3 \w) < 3 and d^(w) < 1. 
Hence 



sr/n 4r 2 m 4r 3 m 



and (since r < 1/12) 

12?/ 2 27/ 

(6) 5 = 328 2 + 165 3 + 4<5 4 < + -«— . 

™ rm 

Then both terms on the right hand side of ([6]) are less than 1/2/c, so 5 < l/12r! is satisfied. 
To apply Theorem 15. 3\ we also require r < l/144r! 2 r = 1/331776. 

If r < 2<iku 2 /m, then the constraint r < l/144r! 2 r is automatically satisfied (since 
m > u 4 /20n and u > /3\/n), and 

urlog(l/r) < (24/cu 3 /m)log(m/(24A;n 2 )) 

n\i/4 (m/n) 1 / 4 



< 20 3/4 48A;v / 7T( — I log 



.mJ 6 (24/t) 1 /2(20)i/4 
=: fi(m), 

where the first inequality holds since r log(l/T) is an increasing function of t when r < 1/e, 
and the second inequality holds since u \og{m/{2Aku 2 )) is an increasing function of u when 
u < e~ 1 / 3 y / m/24/c, and u < (20nm) 1//4 which is less than e -1 / 3 y / m/2Ak because m > eo- 
Alternatively, if r < (4/cii/m) 1//3 then the constraint r < l/144r! 2 r is automatically 
satisfied, and 

urlog(l/r) < (4/cu 4 /27m) 1/3 log(m/4/eu) 
< 6A; 1/3 n 1/3 logn 
=: f2(jn) when m < e(G), 

where the second inequality holds since u 4//3 log(m/4/c?j) is an increasing function of u when 
u < e~ 3 / 4 m/4A; and u < (20nm) 1//4 , together with the bound m < n 4 . Let /^(m) = for 
m > e(G). 
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Let a = 1 — l/2r! and mi = eo/a 1 = /3 4 ra/20a\ The conditions of Theorem 15.31 hold, 
so let C be the collection of containers given by Theorem 15.31 for the graph G, where each 
CgC satisfies e(C) < eo (and hence |C| < no), and log \C\ < 288rr! 2 ^7>o f( m i)- Since /i 
and /2 are non-increasing functions of m, f(m) < max{/i(m),/2(m)} for m > eo- 

Note that Ei>o 7* = 1/(1 - 7) and Ei>o = 7/(1 - 7) 2 , so 

288rr! 2 ^/i(m,) = 288rr! 2 20 3 / 4 48fc v / ^ ( 20a *) V4 log 



-98S , 2 960fc^ / aVMogq/q) log(/3/V480fc) \ 
/3 ^ 4(1 - aV4)2 + i _ a i/4 J 

2 ' 

Observe that rrii > n 4 > e(G) when i > 3 log nj log(l/a) (and hence fii^i) = 0); s ° 

Each Sidon set in [n] is a subset of size at most (1 + o(l))y/n of some C £ C, \C\ < uq. 
By Lemma [5. II the number of such subsets is at most exp{#uo(l + log(l/0))}, where 9uq is 
the maximum size of a Sidon set, so 9 = 1/0 + o(l). Letting 5 be the collection of Sidon 
subsets of [n], 

log | S | 0uo(l + log(l/0)) , 288rr! 2 ^ 288rr! 2 ^ 



v v i>0 v i>0 

< l+log/3 + 7/2 + o(l) < 551og2 + o(l), 

which completes the verification. □ 



12. Sparsity 

In this section we prove Theorem 11.111 and related theorems. Note that the condition 
p > cN~ l / m ( H ^ in Theorem II. Ill is tight up to the value of c. Indeed, if p = o(A r_1 / m ^), 
it is readily checked that for some subgraph H ' C H with m(H') = m(H), the expected 
number of copies of H' is much less than the number of edges, and removing very few edges 
will result in an H-free subgraph. 

As a further illustration of the paradigm described in ^1.61 we prove two other conjectures 
of Kohayakawa, Luczak and Rodl [30J. The first of these has already been proved, by Conlon 
and Gowers |12| for strictly balanced graphs and by Samotij [22], following Schacht [48] . 
for all graphs. It states that, for non-bipartite H, not only does every H-free subgraph 
/ of a random graph have at most (1 + o(l))pTr(H)( 2 ) edges, but in the case that / has 
close to pn(H)(J%^ edges, it can be made (x(H) ~ l)-partite by removing a small number 
of edges. The dense (p = 1) version of this theorem is the stability theorem of Erdos and 
Simonovits HHH51E2|. 

Theorem 12.1. Let H be a 2-graph with tt(H) > and let < 7 < 1. There exist 
constants e, c > such that for N sufficiently large and for p > cN~ l l m ( H \ the following 
is true. Let Eq be the event that there exists an H-free subgraph L C G(N,p) with e(J) > 
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(1 — x (h)-i ~ e )p(^) w hich cannot be made (x{H) — \)-partite by removing at most 7p( 2 ) 
edges. Then F(E ) < exp{-e 2 p(^)}. 

The other conjecture from [30] . sometimes known as the KLR conjecture, has a more 
technical statement. Let G be a graph. For U, W C V(G), write Eq(U, W) C E(G) for the 
set of edges of G with one vertex in U and one vertex in W. Let ec(U, W) = \Eq(U, W)\ 
and write dc(U,W) = ec(U,W) / (\U\\W\) for the edge density. For < rj,p < 1, say that 
the pair (U,W) is (r],p) -regular if for every U' C U with |J7'| > ry|C/| and W C W with 
|W'| > fy|W|, the edge density satisfies 

\d G (U' ,W) - d G (U,W)\ < VP . 

This extends the notion of regularity to sparse graphs of density p. 

Let H be a graph on vertex set [h]. In what follows, V\ U • • • U Vh is a partition of 
[N] = [hn], where each part has size \Vi\ = n. If G is a graph on vertex set [N], say that G 
is (-ff, 7/,p)-regular if for every pair (Vi, Vj) with {i, j} G E(H), the bipartite subgraph of G 
between Vi and ^ is (ry,p)-regular. 

Let G = G(n, M, H) denote a graph chosen uniformly at random from all /i-partite 
graphs with parts V\, . . . ,Vh, having ea(Vi,Vj) = M if {i,j} G E(H) and ea(Vi,Vj) = 
otherwise. We say that G(n, M, H) is H-free if there is no set of vertices vx,...,Vf l with 
Vi G such that {vi,vj} is an edge of G whenever {i,j} G E(H). 

A proof of the next theorem in the case that H is balanced was given by Balogh, Morris 
and Samotij [6]. The theorem verifies the KLR conjecture. 

Theorem 12.2. Let H be a graph and let a > 0. Then there exist r/, c > 0, such that for n 
sufficiently large, if M > cn 2 ~ x l m ^\ then 

F(G(n, M, H) is both H-free and (H,rj, M/n 2 ) -regular) < a M . 

Thus Theorem 112.21 savs that although there might be sparse (H, r/,p)-regular graphs 
with no copy of H (unlike in the dense case), there are extremely few of them. 

A similar result to Theorem 11.111 applies to other structures, such as solutions to linear 
equations as in CCOl So we can say, for example, that if J is a solution-free subset of F of 
maximum size, and if a subset X C F is chosen with probability p, then almost certainly 
the maximum size of a solution-free subset of X is p(\ J\/\F\ + o(l))|F|, provided p is not 
too small. 

Theorem 12.3. Let < 7 < 1 and let (F, A,b,Z) be a kxr linear system with A abundant, 
and additionally with the determinantal of A coprime to F in the case that F is an abelian 
group. Then there exist constants e,c> 0, depending on 7, A when F = [N], and depending 
only on "f,k,r otherwise, such that for \F\ > c, \Z\ < e\F\ r ~ k and p > c\F\~ 1 ^ rriF ^ , if 
X C F is a random subset with each element included independently with probability p, 
then the following event holds with probability greater than 1 — exp{— 7 3 p|i ? |/512}; 

every solution-free subset has at most p(ex(F, A,b) +7^1) elements. 

When F is a finite field, the condition p > c\F\~ 1 ^ rriF ^ in Theorem 112.31 is known to 
be tight up to the value of the constant c appearing, at least under some slightly more 
restrictive assumptions on (F, A,b, Z). See Rodl and Rucihski |4U| . 

Theorems 11.111 112.11 and 112.31 follow by applying the following lemma to the collections 
given by Theorems 11.31 and 110.21 
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Lemma 12.4. Given < v < 1 and s > 1, i/iere is a constant <f) = (j)(v,s) such that the 
following holds. Let M be a set, \M\ = n, and let X C V{M\ Let t > 1, let cpt/n < p < 1 
and let vnj2 < d < n. Suppose for each I El there exists both Tj = (T\, . . . ,T S >) £ V s (I) 
and D = D(Tj) C M, where s' < s, £\ |Xj| < t and \D(Tj)\ < d. Let X C M be a random 
subset where each element is chosen independently with probability p. Then 

(7) F (\D(T) n X\ > (1 + u)pd for some I C X, I el) < exp{-i/ 2 pd/32}. 

Proof. Consider I € I and T = Tj = (Xi, . . . ,T 8 >). Let J(X) = T x U • • • U T s >. Let E T be 
the event that 

J(T) C X and \D(T) n X| > (1 + i/)pd. 

The event Et is contained in Xt Pi Gt, where Ft is the event that J(T) C X and Gt is the 
event that \(D(T) - J(T)) D X| > (1 + v)pd - \J(T)\. Since X T and G T are independent, 
P(-Er) < P(X T )P(G T ). Now |J(T)| < t < pn/(j) < 2pd/<j)V < vpd/2 if <j> is large. So by 
standard estimates (e.g., Chernoff's bound), 

P(Gt) ^ P(Bin(d,p) > (1 + i//2)pd) < exp{-v 2 pd/16}, 

where Bin(n,p) is the binomial random variable. Note that P(-Pr) = p'^ 7 ^'. Let x = 
pn/t > (j), so t < 2pd/xu. If is large we may assume p(n — i) > t, so 

J>(X T ) < £ ( n Vy < (t + 1) (V^LP\ < ( xe 2 2 «)* < (xe 2 2 s )^ < exp^V/32} 
r i=o V * / 

holds if and therefore x, is large. If there exists L <Z X , L £ I with \D(Tj)DX\ > (l+5)pd, 
then the event Er I holds. Hence the probability in ([7]) is bounded by 

^>(F T )P(Gr) < exp{i/V/32} exp{-z/ 2 pd/16} < exp{-v 2 pd/32} 

T 

as claimed. □ 

Proof of Theorem \1.11[ Let X be the set of H- free ^-graphs on vertex set [N] . Let e = 7/4 
and M = [N]W . For I e X, let X = Xj, C = C(X) and c' = c(ff, e) be given by TheoremO 
Our aim is to apply Lemma 112.41 with D(T) = C(T) and 

i/ = 7 /2, d = (7r(J3T) + e) s = c', t = c'N £ - 1 ^ H \ 

The conditions of Lemma 112.41 then hold with n = (ft), noting that d > vn/2 and that 
p > cN~ l l m<yH } > (pt/n if c is large enough. Finally, note that in ([7|), each H-iree ^-graph 
I G X is contained in C(Tj) and (1 + v)pd < (ir(H) + j)p(f), so the probability in the 
statement of the theorem is bounded by 

exp{-i/ 2 pd/32} < exp /— ^p( \ /512 



J. 

completing the proof. □ 

Proof of Theorem UETi Notice that tt(H) = 1 - l/(x(H) - 1) and x(H) > 3. It is a 
standard exercise, using either the stability arguments of Erdos and Simonovits or using 
Szemeredi's regularity lemma, that there exists e > such that if C is a 2-graph on vertex 
set [N] for N sufficiently large with e(C) > (1 — x ^_i — He) (^) and such that C contains 

at most e[ v rff\) copies of H, then there exists a subgraph F C C of size e(F) < (7/2)( 2 ) 
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such that C — F is {x{H) — l)-partite. We may and shall assume that e < 1/66 and 
65e 2 < 7 3 . 

Let X be the set of H-fcee graphs on vertex set [N]. For I G X let T = Tj, C = C(T) 
and d = c(H, e) be given by Theorem 11.31 with e as above. Let 

Zx = (l G X : e{C{Ti)) > ( 1 - .J. - - lie" ( A 



x(^)-i y V2 



X-Xi 



For 7 G Ji let F = F(T/) C C(T/) be as above, so that C(T/) is (x(#) - l)-partite. 

Let X = G(N,p). Let E\ be the event that there exists J C X, I G X\ such that 
\F(Ti) n X\ > 7p(^)- Let F 2 be the event that there exists I C X, I G X 2 such that 
|C(T» n X| > (1 - x(g 1 ) _ 1 - e)p{ N 2 ). Observe that F C F x U E 2 . 

The probability of E 2 is bounded by applying Lemma 112.41 to the collection X 2 , with 
M = [N]W, n={ N 2 ), D(Tj) = C(Tj), u = 10c, d = (1 - - lle)(^) > §(*), s = d 

and t = d N 2 ~ l l m ( H } ; provided p > ciV~ 1 / m ^) and c,N are sufficiently large, 

P(F 2 ) < exp {-z/V/32} < exp j-25e 2 pQ^ /24 

The probability of E\ is bounded by applying Lemma 112.41 to the collection X±, with 
D(Tj) = F(Tj), u = j,d= (7/2) (J), s = c' and i = c'ff 2 " 1 /" 1 ^); provided p > cAr-V^W 
and c, A" are sufficiently large, 

P(.Ei) < exp{-i/ 2 pci/32} = exp / —pj 3 

("2") 7/64 } ~ 6XP { ~ 65pe2 ("2) Z 64 } ' 
Since P(Fo) < P(Fi)+P(F2) and is large, this completes the proof of Theorem ll2.H □ 

Proof of Theorem [WE. Let N = hn, let p = M/n 2 and let X = G(n,M,H). For ease of 
notation we shall often identify graphs with their edge sets. Let v = v(H, a), rj = r](H, v) 
and e = e(H, rj) be sufficiently small constants to be chosen later. Let C be given by 
Theorem O applied with H and e. For T = (T\, ... , T s ), let J(T) = T 1 U---UT S , and 
define the following probabilistic events: 

E T : J(T) C X C C(T) and AT is (H, r/,p)-regular, 
F T : J(T) C X, 

G T : AT C C(T) and X - J{T) is 2r ? ,p)-regular. 

Theorem 11.31 tells us that if X is H-free then there exists T = (T\, . . . ,T S ) with J(T) C 
X C C(T), and if in addition X is (-ff, r/,p)-regular then Et holds. By the union bound, it 
is hence sufficient to show that X^t^F/t) < a M . 

Theorem O also states that s < d and \ J(T)\ < dN 2 ~ l l m{ * H \ where d = d(e,H), so it 
is easily checked that if X is (H, r],p) -regular then X — J(T) is (H, 2ry,p) -regular provided 
that p > {d /rf)^ 1 /™^. This holds provided M > cn 2 " 1 /"^ for c sufficiently large, 
which we shall assume. Therefore E T C F T n G T and so P(F T ) < P(F T )P(G T |F T ). 

In order to bound P(G T |F T ), let T be fixed, let J = J(T) and let C = C{T) G C. Consider 
an 77-regular Szemeredi partition of C refining the partition V\, . . . , Vh, i.e., for r sufficiently 
large depending only on r/, and for each i G [/i] an equitable partition Vi = V^i U • • • U Vi^ r 
so that all but at most r]r 2 e(H) of the bipartite graphs between pairs (Vi X , Vj jV ) with 
{i,j} G E{H) are 77-regular. 
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Provided r\ and e are sufficiently small then, by standard arguments, for every choice 
of xi,...,Xh G [r] there exists some pair {i,j} G E(H) such that the pair (V^ Xi , V^) is 
either irregular or has density dc(Vi^ Xi ,Vj tXj ) < v (since otherwise there would be more 
than eN h copies of H in C, contradicting the conditions of Theorem 1 1.3[) . In particular, by 
averaging, there exists {io,jo} G E(H) such that at least r 2 /e(H) of the pairs (Vi 0lX , Vj 0! y) 
with x, y G [r] are either irregular or of density less than v. Let 

R = {(Vio,x,Vj 0) y) : x,y G [r] and dcO^,*, Vf ,i/) < f} 3 
S 1 = {{u, w} : u £ U, w £ W, for some (E7, W) G 

Since there are at most r]r 2 e(H) irregular pairs (Vi Q ,x,Vj ,y) with x,y G [r], this implies 
that > r 2 /2e(H) for 77 sufficiently small. Allowing for the sets not being all 
exactly the same size, this implies that | *S' | > n 2 /4e(H). Note that, by the definition of R, 
\SnC\ < u\S\ < vn 2 holds. 

Write P(i,j) for the set of pairs of vertices {{u,v} : u G V^v G Vj}. If Gt holds, then 
X — J is 2r/,p)-regular and so 

(8) \Sn (X - J)\ > (1 - 2»j)p|5| > M/8e(H). 

However |5 n C\ < vn 2 and if Gt holds then X C C, so the probability of ([8]) is small. 
Specifically, in generating the random graph {X— J)nP(io,jo) when conditioned on J C X, 
we are selecting a set of M — \ J n P(io,io)| < edges uniformly from at least n 2 — \ J n 
(*o? Jo) I > ^ 2 /2 possible edges, and for flSJ) to hold, we must select at least M/8e(H) edges 
from a set of vn 2 possibilities. This probability is at most 

provided f = u(H,a) is small enough. 

Thus £ T P(Er) < Er IP ( F T)F(GT|i ; T) < (a/2) M Et^t), and to complete the proof 
it is enough to show J2 T F(F T ) < 2 M . Now 

^P(f T ) = ^P(J(T)cI) = ^El J(T)cX = E|{T : J(T)d}|. 

T T T 

Since T = (Ti,...,T s ) where s < c', Lemma [5.11 tells us that \{T : J(T) C X}\ < 
exp{(/6\X\(l + log(l/0))} where 9\X\ is the average size of the Tj. Now |X| = Me(H) > 
ce{H)n 2 ~ l l m ( H } and < c'n 2 ~ l / m ^ H \ We can make as small as we wish to by making 
c large, thus ensuring \{T : J{T) C X}\ < 2 M , as desired. □ 

Proof of Theorem \12.'A Repeat the proof of Theorem 11.111 with M = F and use Theo- 
rem [HO] instead of Theorem 11.31 for the collection C. □ 

Proof of Theorem \1.1SX Let ([N], A, b, Z) be the (£ — 2) x £ linear system corresponding to 
forbidding an £-teim arithmetic progression in [N]. For example if £ = 3 then A = (1, 1, —2), 
b = (0) and Z is the set of solutions of the form x + x — 2x = that are discounted, so 
\Z\ = N. It can readily be checked that mrm(A) =1—1. Since ex(F, A, b) = 0, the result 
immediately follows by applying Theorem 112.31 □ 
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